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MEMORY TESTER TESTS MULTIPLE DUT'S 
PER TEST SITE 

REFERENCE TO RELATED APPLICATIONS 

The subject matter of the instant Patent Application is 
related to that disclosed in a pending U.S. Patent Application 
entitled MEMORY TESTER HAS MEMORY SETS CON- 
FIGURABLE FOR USE AS ERROR CATCH RAM, TAG 
RAM's, BUFFER MEMORIES AND STIMULUS LOG 
RAM, Ser. No. 09/672,650 and filed on Sep. 28, 2000. That 
disclosure describes aspects of operations called Address 
Classification and Data Classification that are of interest 
herein. For that reason U.S. patent application Ser. No. 
09/672,650 is hereby expressly incorporated herein by ref- 
erence. 

The subject matter of the instant Patent Application is also 
related to that disclosed in a pending U.S. Patent Application 
entitled METHOD AND APPARATUS FOR 
NO-LATENCY CONDITIONAL BRANCHING, Ser. No. 
09/659,256 and filed on Aug. 28, 2000. That disclosure is 
related to branching in a memory test program that is 
conditioned upon events within one Device Under Test. The 
instant Application extends that to a selected Device Under 
Test from among many such being tested. For that reason 
U.S. patent application Ser. No. 09/659,256 is hereby 
expressly incorporated herein by reference. 

BACKGROUND OF THE INVENTION 

Electronics devices and capabilities have grown 
extremely common in daily life. Along with personal com- 
puters in the home, many individuals carry more than one 
productivity tool for various and sundry purposes. Most 
personal productivity electronic devices include some form 
of non -volatile memory. Cell phones utilize non-volatile 
memory in order to store and retain user programmed phone 
numbers and configurations when the power is turned off. 
PCMCIA cards utilize non-volatile memory to store and 
retain information even when the card is removed from its 
slot in the computer. Many other common electronic devices 
also benefit from the long-term storage capability of non- 
volatile memory in un-powered assemblies. 

Non-volatile memory manufacturers that sell to the elec- 
tronic equipment manufacturers require testers to exercise 
and verify the proper operation of the memories that they 
produce. Due to the volume of non-volatile memories that 
are manufactured and sold at consistently low prices, it is 
very important to minimize the time it takes to test a single 
part. Purchasers of non-volatile memories require memory 
manufacturers to provide high shipment yields because of 
the cost savings associated with the practice of incorporating 
the memory devices into more expensive assemblies with 
minimal or no testing. Accordingly, the memory testing 
process must be sufficiently efficient to identify a large 
percentage of non-conforming parts and preferably all non- 
conforming parts in a single test process. 

As non-volatile memories become larger, denser and 
more complex, the testers must be able to handle the 
increased size and complexity without significantly increas- 
ing the time it takes to test them. Memory tester frequently 
run continuously, and test time is considered a major factor 
in the cost of the final part. As memories evolve and 
improve, the tester must be able to easily accommodate the 
changes made to the device. Another issue specific to testing 
non-volatile memories is that repeated writes to cells of the 
memories can degrade the overall lifetime performance of 
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the part. Non-volatile memory manufacturers have 
responded to many of the testing issues by building special 
test modes into the memory devices. These test modes are 
not used at all by the purchaser of the memory, but may be 

5 accessed by the manufacturer to test all or significant 
portions of the memories in as little time as possible and as 
efficiently as possible. Some non-volatile memories are also 
capable of being repaired during the test process. The tester, 
therefore, should be able to identify: a need for repair; a 

3Q location of the repair; the type of repair needed; and, must 
then be able to perform the appropriate repair. Such a repair 
process requires a tester that is able to detect and isolate a 
specific nonconforming portion of the memory. In order to 
take full advantage of the special test modes as well as the 

35 repair functions, it is beneficial for a tester to be able to 
execute a test program that supports conditional branching 
based upon an expected response from the device. 

From a conceptual perspective, the process of testing 
memories is an algorithmic process. As an example, typical 

20 tests include sequentially incrementing or decrementing 
memory addresses while writing 0's and l's into the 
memory cells. It is customary to refer to a collection of l*s 
and 0's being written or read during a memory cycle as a 
"vector", while the term "pattern" refers to a sequence of 

25 vectors. It is conventional for tests to include writing pat- 
terns into the memory space such as checkerboards, walking 
l's and butterfly patterns. A test developer can more easily 
and efficiently generate a program to create these patterns 
with the aid of algorithmic constructs. A test pattern that is 

30 algorithmically coherent is also easier to debug and use 
logical methods to isolate portions of the pattern that do not 
perform as expected. A test pattern that is generated algo- 
rithmically using instructions and commands that are 
repeated in programming loops consume less space intester 

35 memory. Accordingly, it is desirable to have algorithmic test 
pattern, generation capability in a memory tester. 

Precise signal edge placement and detection is also a 
consideration in the effectiveness of a non-volatile memory 
tester. In order to identify parts that are generally conform- 

40 ing at a median while not conforming within the specified 
margins, a non-volatile memory tester must be able to 
precisely place each signal edge relative in time to another 
signal edge. It is also important to be able to precisely 
measure at which point in time a signal edge is received. 

45 Accordingly, a non-volatile memory tester should have 
sufficient flexibility and control of the timing and placement 
of stimuli and responses from the Device Under Test 
(memory). 

Memory testers are said to generate transmit vectors that 

50 are applied (stimulus) to the DUT (Device Under Test), and 
receive vectors that are expected in return (response). The 
algorithmic logic that generates these vectors can generally 
do so without troubling itself about how a particular bit in a 
vector is to get to or from a particular signal pad in the DUT, 

55 as the memory tester contains mapping arrangements to 
route signals to and from the pins that contact the DUT. The 
collection of the algorithmic pattern generation, threshold 
'setting, signal conditioning and comparison mechanisms, 
and the probes that connect that stuff to the DUT, is called 

60 a test site. In the simple case there is one DUT per test site. 
Memory testers have interior test memory that is used to 
facilitate the test process. This interior test memory may be 
used for several purposes, among which are storing transmit 
vectors ahead of time, as opposed to generating them in real 

65 time, storing expected receive vectors, and storing a variety 
of error indications and other information concerning DUT 
behavior obtained during testing. (There are also housekeep- 
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ing purposes internal to the operation of the memory tester 
that use RAM and that may appear to fall within the purview 
of the phrase "interior memory." These are private to the 
internal operation of the tester, tend to not be visible at the 
algorithmic level, and are comparable to executable instruc- 
tion stores and to internal control registers. That memory is 
described as "interior control memory," and is excluded 
from what is meant herein by the term "interior test 
memory," which we use to describe memory used to store bit 
patterns directly related to the stimulus of, and response 
from, the DUT.) It is easy to appreciate that this interior test 
memory needs to operate at least as fast as the tests being 
performed; a very common paradigm is for the interior test 
memory (or some portion thereof) to be addressed by the 
same address (or some derivative thereof) as is applied to the 
DUT. What is then stored at that addressed location in 
interior test memory is something indicative of DUT behav- 
ior during a test operation performed on the DUT at that 
address. Algorithmic considerations within the test program 
may mean that the sequence of addresses associated with 
consecutive transmit vectors can be arbitrary. Thus, the 
interior memory needs to have the dual attributes of high 
speed and random addressability. SRAM comes to mind 
immediately as being fast, easy to control and tolerant of 
totally random addressing. Indeed, conventional memory 
testers have used SRAM as their interior test memory. 

Unfortunately, SRAM is quite expensive, and this has 
limited the amount of interior test memory with which 
memory testers have had to work. The result is limits on 
memory tester functionality that are imposed by a shortage 
of memory. DRAM is significantly less expensive, but 
cannot tolerate random addressing and still perform at high 
speed. 

DRAM can replace SRAM as the interior test memory in 
a memory tester. As briefly described below, the problem of 
increasing the speed of DRAM operation for use as interior 
test memory can be solved by increasing the amount of 
DRAM used, in place of increasing its speed. Numbers of 
identical Banks of DRAM are treated as Groups. A combi- 
nation of interleaving signals for different Banks of memory 
in a Group thereof and multiplexing between those Groups 
of Banks slows the memory traffic for any one Bank down 
to a rate that can be handled by the Bank. 

At the top level of interior test memory organization there 
are four Memory Sets, each having its own separate and 
independent address space and performing requested 
memory transactions. Two are of DRAM as described 
above, and two are of SRAM. Each Memory Set has its own 
controller to which memory transactions are directed. As to 
externally visible operational capabilities as memories, all 
four Memory Sets are essentially identical. They differ only 
in their size of memory space and how they are internally 
implemented: The SRAM Memory Sets do not employ 
multiplexing and interleaving, since they are fast enough to 
begin with. Despite their independence, Memory Sets of the 
same type (of SRAM or of DRAM) may be "stacked," 
which is to say treated a one larger address space. 

Thus it is that the interior test memory of the tester is 
divided into four Memory Sets, two of which are "internal" 
SRAM's and two of which are "external" DRAM's. To be 
sure, all this memory is physically inside the memory tester; 
the terms "internal" and "external" have more to do with a 
level of integration. The S RAM's are integral parts of a 
VLSI (Very Large Scale Integration) circuit associated with 
the tester's central functional circuitry, while the DRAM's 
are individual packaged parts mounted adjacent the VLSI 
stuff. The amount of SRAM is fairly small, (say, around a 
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megabit per Memory Set) while the amount of DRAM is 
substantial and selectable (say, in the range of 128 to 1024 
megabits per Memory Set). The SRAM Memory Sets are 
always present, and may be used for any suitable purpose, 

5 such as storing the expected content of a DUT that is a ROM 
(Read Only Memory). The DRAM Memory Sets, although 
actually optional, are typically used for creating a trace for 
subsequent analysis leading to repair, although there are also 
other uses. The tester does not enforce major distinctions 

10 between the SRAM and DRAM Memory Sets, as to different 
purposes for which they may be used. There are some 
practical major distinctions that arise mostly as a matter of 
size. The SRAM Memory Sets are small, while the DRAM 
Memory Sets are potentially huge. The person or persons 

15 creating the test programming generally make the decisions 
concerning how the various Memory Sets are to be used. 
There are also a few minor distinctions where a particular 
operational feature of the memory tester requires the use of 
a specific Memory Set. These cases usually arise out of 

20 economic or performance considerations that require a dedi- 
cated hardware path to a Memory Set. It is expedient to 
simply pick a likely one, and let it go at that. 

The advent of substantial amounts of interior test memory 
(in the form of the DRAM Memory Sets) raises the issue of 

25 how this additional amount of memory can be used to 
facilitate the operation of desirable features within the 
memory tester. In the tester of interest the interior test 
memory subsystem is extremely flexible, in that despite 
having a native word width of thirty-two bits, the effective 

30 word width can be any power of two (up to 2 5 »32), with a 
corresponding increase in address space for narrower words. 
There is an extensive address mapping capability, both for 
addressing DUT's and for addressing interior test memory, 
substantial data classification and address classification 

35 mechanisms that facilitate multiple Tag RAM's and other 
error analysis tools, all of which are made more practical by 
having lots of interior test memory. Moreover, these 
enhancements made possible by more memory do not exist 
in a vacuum; they are very valuable in the testing of certain 

40 types of memory parts. 

Despite that recent advances have produced memory parts 
of truly enormous capacity (512MB) and wide data paths 
(thirty-two bits), yesterday's four, eight and sixteen bit parts 
are still solidly in commercial service. It is even the case that 

45 some high capacity parts have been "throttled down" to a 
narrow path for address and data, even at the expense of 
serialization or of supporting multiple cycle segmentation to 
transport whole data of a wider native word width. There are 
various reasons for this situation: either small parts are all 

50 that is needed, or, the application is such that big parts can 
have narrow paths (video applications that are always 
addressed sequentially). Suffice it to say that there are good 
economic reasons why memory parts with narrow paths are 
well received in the market. And that means that they have 

55 to be tested. 

Consider what this means to a purveyor of a top of the line 
memory tester that boasts of being able to test parts requiring 
up to sixty-four channels. From time to time his customer 
wants to check eight bit parts. What with supplies and 

60 ground, I/O bus, clocks and assorted control lines, there are 
perhaps from twelve to sixteen channels that will be needed. 
Suppose it is fourteen. Does this mean that fifty channels 
will be left unused when the part to be tested is located under 
the test head? Considering that the memory industry con- 

65 siders time on the tester as money, and that memory tester 
often run twenty-four hours a day, seven days a week, our 
purveyor and his customer are both interested in ways that 
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make the memory tester more competitive. To the customer, 
unused test capacity is wasted money. What the designers of 
the memory tester can do is increase the flexibility and 
configurability of their machine to allow it to test more parts 
at once. This defuses the need for the customer to purchase 
many different models, and even though his per tester cost 
remains at the top of the line level, the customer gets a fair 
value in terms of increased throughput, longer life of the 
investment, etc. 

The tester's designers note that sixty-four channels is 
enough to do as many as sixteen channels four times, and 
readily reach the conclusion that a combination of four 
undiced DUTs on a wafer, or four packaged parts, can be 
construed as some strange single big part that just happens 
to look exactly like the union of four little parts. There is no 
question about it: in theory, those four little parts can be 
tested at the same time, simultaneously, under the same test 
head. The trouble is that it is awkward and inconvenient. The 
test program has to be re-written, which is so disgusting that 
it is not considered a viable solution. This is especially so for 
flash memory parts, since a failure in one DUT can expose 
other DUTs being tested under the same test head to 
undesirable, or even harmful, stresses having to do with 
excessive numbers of write cycles or repeated high drive 
levels, brought about by programmed-in escalation triggered 
by marginal or failed behavior of one DUT. But the design- 
ers of test programs want to avoid punishing the innocent 
along with the guilty, so to speak. (The class of so-called 
"NAND" parts is especially sensitive to this sort of thing.) 
The writing of a single-thread-of-control test program that 
emulates a four-threads-of-control test program, where each 
of those four threads (what is done for an individual DUT) 
has conditional branching and other result dependent behav- 
ior is, well, a nightmare. That sort of situation would seem 
to need four independent execution mechanisms with four 
simple programs whose internal execution paths could inde- 
pendently diverge as needed. But then, how to return to the 
other extreme, where all sixty-four channels need to be 
controlled by one program testing an actual single big part? 
Four little processors do not make a big processor! 

There has to be a better way than putting four big 
processors, or three little ones and one big one, behind the 
test head. Especially so if test sites are also to be capable of 
being bonded together to increase their size. 

Some types of memory devices require that a write to a 
location be done many times in order to "program" that 
location with the written value. There is also an issue called 
"overprogramming" that refers to needlessly continuing to 
write the value to be programed at a location after it has been 
successfully programmed. Such continued writing 
(overprogramming) can damage the part. There are various 
techniques that have been used to prevent overprogram- 
ming. Since these programming activities usually take place 
within tightly written loops embedded in other functions, 
any extraneous activity in the interior loops is inefficient. It 
would therefore be desirable if the test program overhead 
needed to program a location and prevent overprogramming 
could be significantly reduced. 

Thus, the situation is this. We have a powerful tester that 
has an abundance of channels per test site, with powerful 
algorithmic capabilities to generate patterns and perform 
failure analysis, and that can attempt repairs by blowing 
fusible elements in the DUT to cause substitute circuits to 
replace bad ones. The tester has lots of memory that is easily 
partitioned, reconfigured and variously mapped, and as such 
lends itself to maintaining nicely separated results for mul- 
tiple DUTs, if we could just test them all at the same time 
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on the same test site. But the size of the algorithmic control 
mechanism of the test site is determined by the maximum 
number of channels, and does not readily decompose into 
usable smaller mechanisms. And yet there are all these small 

5 parts out there to be tested, and time on the tester is money. 
What to do? 

SUMMARY OF THE INVENTION 
The internal architecture of a memory tester is enhanced 

io to support the testing of multiple DUTs of the same type at 
a test site, while requiring only minor modifications to a test 
program that would ordinarily be used to test a single DUT. 
The multiple DUTs are electrically isolated from one 
another, but are, at least at the outset, given the same 

15 stimulus from which the same response is expected. To do 
this the DUTs are each associated with collections of 
channels that are to be used to test their respective DUTs. 
The tester can be instructed to replicate the segments of the 
test vectors needed to test one DUT on the channels for the 

20 other DUTs. This produces patterns of (sequences of) 
transmit (stimulus) and receive (response) vectors that are 
"n-many DUTs wide," as it were. Conditional branching 
within the test program in response to conditions in the 
receive vectors (DUT failure) is supported by recognizing 

25 several types of error indications and an ability to selectively 
disable the testing of one or more DUTs while continuing 
to test the one or more that are not disabled. The error 
indications include per channel functional error flags and per 
DUT functional error flags, as well as per DUT parametric 

30 error flags. The per DUT functional error flags are created 
from an OR'ing of the per channel functional error flags 
done in accordance with the DUT to channel association. 
These flags are reported back to the test program, which can 
then take various appropriate actions that alter program flow 

35 to invoke special testing or other actions for the suspect 
DUT. The error conditions are also detected by pre- 
programmed mechanisms within the circuitry that applies 
transmit vectors and evaluates receive vectors. These latter 
pre-programmed mechanisms produce actions that include, 

40 on a per channel basis, the ability to alter the drive and 
receive formats for the DUT signal of that channel, and the 
ability to alter a data value associated with the DUT signal 
of that channel. Also included are ways to remove or limit 
stimulus to particular DUTs, as well as ways to make all 

45 comparisons for a particular DUT appear to be "good," 
regardless of the true facts. These latter mechanisms remove 
the need for multiple threads of execution in the test pro- 
gram. 

The test program is thus empowered to simultaneously 

50 test a plurality (up to four in a preferred embodiment) of 
DUTs "in parallel" as long as no errors are reported. Upon 
an error the test program can branch to an error investigation 
routine that exercises only the suspect DUT by temporarily 
disabling the good DUTs while continuing to exercise the 

55 bad DUT. If the bad DUT is defective beyond repair, or is 
unsafe to leave powered up, it can be disabled, the other 
DUTs re-enabled and regular testing resumed. In this way, 
a single thread of execution (that would exist in essentially 
this same form for testing a single DUT) can be selectively 

60 switched between (executed on behalf of) the different 
DUTs on an as needed basis, driven by which one(s) fail(s). 
During these selectively switched intervals simultaneous 
testing of all DUTs is suspended in favor of a digression 
that tests a particular DUT of interest. To be sure, this 

65 switching and jumping to execute digressions is also pro- 
grammatically defined to occur in response to contingent 
events discovered while testing. Part of this programmatic 
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definition is easily performed modifications to the single 
threaded test program (which remains single threaded) and 
part of it is pre-configuration of various hardware assist 
mechanisms. 

These features may be combined with automatic reading 5 
of a special bad block table created in interior test memory 
to facilitate the testing of memory parts that have an internal 
block structure, by automatically disabling, and removing 
from further influence on the test program, actions related to 
a bad block. That bad block may or may not be in a DUT that 10 
is being tested in a multi-DUT fashion. These features may 
also be used to prevent overprogramming. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG, 1 is a simplified block diagram of an extensively 15 
reconfigurable non-volatile memory tester constructed in 
accordance with the invention; 

FIG. 2 is a simplified block diagram expansion of the 
DUT tester 6 of FIG. 1; 20 

FIG. 3 is a simplified functional block diagram of the 
interior test memory mechanism that appears in the block 
diagram of FIG. 2; 

FIG. 4 is a block diagram of a DUT Disable Logic portion 
of the block diagram of FIG. 2; 25 

FIG. 5 is a simplified block diagram, most of which is of 
a portion of the TIMING/FORMATTING & COMPARI- 
SON circuit of FIG. 2, that describes how individual chan- 
nels are automatically disabled during multi-DUT testing; 

FIG. 6 is a block diagram of logic circuitry located within 
the Next Address Calculator of the Micro-Controller 
Sequencer of FIG. 2 and that facilitates test program branch- 
ing on conditions associated with different DUT's during 
multi-DUT testing; and 35 

FIG. 7 is a simplified diagram illustrating a technique for 
automatically disabling channels associated with a bad block 
in a memory DUT having a block structure. 

DESCRIPTION OF A PREFERRED 
EMBODIMENT 

Refer now to FIG. 1, wherein is shown a simplified block 
diagram 1 of a Non-Volatile Memory Test System con- 
structed in accordance with the principles of the invention. 
In particular, the system shown can simultaneously test, with 45 
as many as sixty-four test points each, up to thirty-six 
individual DUT's (Devices Under Test) at one time, with 
provisions for reconfiguration to allow elements of a col- 
lection of test resources to be bonded together to test DUT's 
having more than sixty-four test points. These test points 50 
may be locations on a portion of an integrated circuit wafer 
that has not yet been diced and packaged, or they might be 
the pins of a packaged part. The term "test point" refers to 
an electrical location where a signal may be applied (e.g., 
power supplies, clocks, data inputs) or where a signal can be 55 
measured (e.g., a data output). We shall follow the industry 
custom of referring to the test points as "channels". The 
"collection of test resources to be bonded together" referred 
to above may be understood as being as many as thirty-six 
test sites, where each test site includes a Test Site Controller 60 
(4), a (sixty-four channel) DUT Tester (6) and a (sixty-four 
channel) collection of Pin Electronics (9) that makes actual 
electrical connection to a DUT (14). In the case where 
testing the DUT requires sixty-four or fewer channels, a 
single Test Site is sufficient to perform tests upon that DUT, 65 
and we say, for example, that the Test Site #1 (as it appears 
in FIG. 1) forms or operates as a "Single Site Test Station". 
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On the other hand, when some form of the aforementioned 
reconfiguration is in effect, two (or more) Test Sites are 
"bonded" together to function as one larger equivalent Test 
Site having one hundred and twenty-eight channels. 
Accordingly, and again in reference to an example shown in 
FIG. 1, we say that Test Sites #35 and #36 form a "two-Site 
Test Station". 

To briefly consider an opposing case, one should not 
assume that an entire Test Site is needed to test a single DUT, 
or that a single Test Site can test but a single DUT. Suppose 
that a wafer had two, three or four (probably, but not 
necessarily, adjacent) dies, the sum of whose test channel 
requirements were sixty-four channels or less. Such DUT's 
(ISa-d) can be tested simultaneous by a single Test Site 
(e.g., Test Site #2 as shown in FIG. 2). What makes this 
possible is the general purpose programmability of each Test 
Site, as augmented by certain hardware features to be 
described in due course. In principle, a test program 
executed by the Test Site could be written such that one part 
of the Test Site's resources is used to test one of the DUT's 
while another part is used to test the other DUT. After all, we 
would assume that if we had a third DUT that were the 
logical union of the first two, then we would be able to test 
that third DUT with a single Test Site, so we ought to be able 
to similarly test its "component DUT's", as it were. A major 
difference is, of course, individually keeping track of which 
of the two "component DUT's" pass or fail, as opposed to 
a simple unified answer for the "third" DUT. That is, there 
is an issue concerning what portion of the "third" DUT 
failed. There are other issues as well, including removing or 
limiting the drive signals to a bad DUT, branching within the 
test program based on which DUT indicates failure, while at 
the same time preventing the test program from becoming 
hopelessly multi-threaded. Certain simple aspects of this 
"Multi-DUT Test Station" capability at a single Test Site are 
fairly simple, while others are complex, and all will be 
explained in due course. Multi-DUT testing should not be 
confused with the notion of bonding two or more Test Sites 
together. 

Were it not for this notion of Test Site reconfiguration 
there would be no difference between a Test Site and a Test 
Station, and we would dispense with one of the terms. As it 
is, however, it will be readily appreciated that the number of 
Test Stations need not equal the number of Test Sites. In the 
past, the numbers could be different because Test Sites were 
sometimes split to create more Test Stations for simple 
Multi-DUT testing (DUT's not complex enough to consume 
an entire Test Site). Now, however, the difference may also 
be due to Test Sites having been bonded together to form 
multi-site Test Stations (DUT's too complex for a single Test 
Site). 

To continue, then, a Test System Controller 2 is connected 
by a System Bus 3 to as many as thirty-six Test Site 
Controllers whose names end in the suffixes #1 through #36 
(4a-4z). (It is true that subscripts a-z only go from one to 
twenty-six, and not to thirty-six. But this minor deception 
seems preferable over numerical subscripts on numerical 
reference characters, which would be potentially very 
confusing.) The Test System Controller 2 is a computer (e.g., 
a PC running NT) executing a suitable Test System Control 
Program pertaining to the task of testing non-volatile memo- 
ries. The Test System Control Program represents the high- 
est level of abstraction in a hierarchical division of labor 
(and of complexity) for accomplishing the desired testing. 
The Test System Controller determines which programs are 
being run by the different Test Sites, as well as overseeing 
a robotics system (not shown) that moves the test probes and 
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DUTs as needed. Test System Controller 2 may function in 
ways that support the notion that some Test Sites are 
programmed to perform as single-site Test Stations, while 
others are bonded together to form multi-site Test Stations. 
Clearly, in such circumstances there are different parts being 5 
tested, and it is most desirable that different tests be used for 
the different parts. Likewise, there is no requirement that all 
single-site Test Stations be testing the same style of part, nor 
is there any such requirement for multi-site Test Stations. 
Accordingly, the Test System Controller 2 is programmed to 10 
issue the commands to accomplish the needed Test Site 
bonding and then to invoke the appropriate test programs for 
the various Test Stations in use. The Test System Controller 
2 also receives information about results obtained from the 
tests, so that it may take the appropriate action for discarding ^ 
the bad part and so that it may maintain logs for the various 
analyses that may be used to control, say, production pro- 
cesses in a factory setting. 

The Test System itself is a fairly large and complex 
system, and it is common for it to use a robotics subsystem 20 
to load wafers onto a stage that then sequentially positions 
one or more future dies under probes connected to the Pin 
Electronics 9, whereupon those future dies (the wafer has 
not yet been diced) are tested. The Test System can also be 
used to test packaged parts that have been loaded onto a 25 
suitable carrier. There will be (as is explained below), at 
least one Test Site Controller associated with each Test 
Station in use, regardless of how many Test Sites are used to 
form that Test Station, or of how many Test Stations are on 
a Test Site. A Test Site Controller is an embedded system that 30 
may be an i960 processor from Intel with thirty-six to 
sixty-four MB of combined program and data memory 
running a proprietary operating system called VOS 
(VersaTest O/S), which was also used in earlier products for 
testing non-volatile memories (e.g., the Agilent VI 300 or 35 
V3300). For the moment, we shall consider only the situa- 
tion for single-site Test Stations. For the sake of a definite 
example, suppose that Test Site #1 is functioning as Test 
Station #1, and that it is to test the WHIZCO part no. 0013. 
The test regimen involves a hundred or so different types of 40 
tests (varying and monitoring voltage levels, pulse widths, 
edge positions, delays, as well as a large dose of simply 
storing and then retrieving selected patterns of information), 
and each type of test involves many millions of individual 
memory cycles for the DUT. At the highest level, the 45 
operators of the Test System instruct the Test System Con- 
troller 2 to use Test Station #1 to begin testing WHIZCO 
0013's. In due course the Test System Controller 2 tells Test 
Site Controller #1 (4a) (which is an embedded [computer] 
system) to run the associated test program, say, TEST_ 50 
WHIZ_13. If that program is already available within Test 
Site Controller #l's environment, then it is simply executed. 
If not, then it is supplied by the Test System Controller 2. 

Now, in principle, the program TEST_WHIZ_13 could 
be entirely self-contained. But if it were, then it would 55 
almost certainly be rather large, and it may be difficult for 
the processor of the embedded system within the Test Site 
Controller 4a to run fast enough to produce the tests at the 
desired speed, or even at a rate that is uniform from one 
DUT memory cycle to the next. Accordingly, low level 60 
subroutine type activities that generate sequences of address 
and associated data that is to be written or is expected from 
a read operation, are generated as needed by a program- 
mable algorithmic mechanism located in the DUT Tester 6, 
but that operates in synchrony with the program being 65 
executed by the embedded system in the Test Site Controller 
4. Think of this as exporting certain low level subroutine- 
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like activity and the task of initiating DUT memory cycles 
out to a mechanism (the DUT Tester) that is closer to the 
hardware environment of the DUT 14, Generally speaking, 
then, whenever the Test System Controller 2 equips a Test 
Site Controller with a test program it also supplies the 
associated DUT Tester with appropriate low level imple- 
mentation routines (perhaps specific to the memory being 
tested) needed to accomplish the overall activity described 
or needed by the programming for the Test Site Controller. 
The low level implementation routines are termed 
"patterns", and they are generally named (Oust as functions 
and variables in high level programming languages have 
names). 

Each Test Site Controller #n (4) is coupled to its associ- 
ated DUT Tester #n (6) by a Site Test Bus #n (5). The Test 
Site Controller uses the Site Test Bus 5 to both control the 
operation of the DUT Tester and receive therefrom infor- 
mation about test outcomes. The DUT Tester 6 is capable of 
generating at high speed the various DUT memory cycles 
that are involved in the test regimen, and it decides if the 
results of a Read memory cycle are as expected. In essence, 
it responds to commands or operation codes ("named 
patterns") sent from the Test Site Controller by initiating 
corresponding useful sequences of Read and Write DUT 
memory cycles (i.e., it executes the corresponding patterns). 
Conceptually, the output of the DUT Tester 6 is stimulus 
information that is to be applied to the DUT, and it also 
accepts response information therefrom. This stimulus/ 
response information la passes between the DUT Tester 6a 
and a Pin Electronics #1 assembly 9a. The Pin Electronics 
assembly 9a supports up to sixty-four probes that can be 
applied to the DUT 14. 

The above-mentioned stimulus information is just a 
sequence of parallel bit patterns (i.e., a sequence of "transmit 
vectors" and expected "receive vectors") expressed accord- 
ing to the voltage levels of some family of logic devices used 
in the DUT Tester. There is a configurable mapping between 
bit positions within a stimulus/response and the probes on 
the die, and this mapping is understood by the DUT Tester 
6. The individual bits are correct as to their timing and edge 
placement, but in addition to the mapping they may also 
need voltage level shifting before they can be applied to the 
DUT. Likewise, a response that originates in the DUT 
subsequent to a stimulus may need buffering and (reverse) 
level shifting before it can be considered suitable for being 
fed back to the DUT Tester. These level shifting tasks are the 
province of the Pin Electronics 9a. The Pin Electronics 
configuration needed for testing a WHIZCO 0013 likely will 
not work for testing a part from the ACME Co., and perhaps 
not even with another WHIZ Co, part. So, it will be 
appreciated that the Pin Electronics assembly needs to be 
configurable also; such configurability is the function of the 
PE Config lines &a. 

The above concludes a brief architectural overview of 
how a single Test Site is structured for testing a DUT. We 
turn now to issues that arise when there are many Test Sites 
with which to operate. As a preliminary, we shall describe a 
preferred embodiment for constructing a Test System having 
multiple Test Sites. In many respects, some of the informa- 
tion we are about to describe are matters of choice based on 
market studies of customer preference and cost benefit 
analyses. Be that as it may, to build one of these things one 
has to make definite choices, and once that is done there are 
particular consequences that are visible throughout the entire 
system. It is felt that it is useful to describe, at least in a 
general way, the larger outlines of the hardware properties of 
the Test System. Even though some of these properties are 
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contingent, a knowledge of them will nevertheless assist in 
an appreciation of various examples used to illustrate the 
invention. 

To begin, then, consider four rather large card cages. Each 
card cage has, besides power supplies and water cooling 5 
(fans can be a source of contamination in a clean room 
environment), a mother board, a front plane and a back 
plane. Into each card cage can be placed up to nine assem- 
blies. Each assembly includes a Test Site Controller, DUT 
Tester and Pin Electronics. We shall be describing the 10 
general outlines of how Test Site Controllers are bonded 
together, which will involve some busses used to create 
daisy chains. 

A brief digression concerning the term "daisy chain" is 
perhaps in order. Consider system elements A, B, C and D. 15 
Suppose that they are to be daisy chained together in that 
order. We could say that there is an information or control 
path that leaves A and goes into B, that B can selectively 
pass on traffic that then leaves B and goes into C, and that 
C can selectively pass on traffic that then goes into D. These 20 
same kind of arrangements can exist for traffic in the other 
direction, too. Daisy chains are often used to create priority 
schemes; we shall use them to create master/slave relation- 
ships between various the Test Site Controllers. We shall 
denote these daisy chained style communication arrange- 25 
ments with the suffix noun "DSY", instead of "BUS". Thus, 
we might refer to a Command/Data DSY instead of a 
Command/Data Bus. Now, the notion that information 
"enters B and is selectively passed on" may suggest that 
traffic is replicated onto a separate set of conductors before 30 
being passed on. It could be that way, but for performance 
reasons it is more like a regular bus having addressable 
entities. By means of a programmable address mapping 
arrangement and the ability to put portions of downstream 
Test Site Controllers "to sleep," the single bus can be made 35 
to logically appear (i.e., to function) as a plurality of daisy 
chains. Finally, it will be appreciated that the daisy chains 
are high performance pathways for command and control 
information, and that if they were not, then we could not 
expect a master/slave combination (multi-site Test Station) 40 
to operate as fast as a single Test Site does. For the benefit 
of daisy chain performance, the various DSY do not leave 
their respective card cages. The effect of this decision is to 
place some limits on which Test Sites (and thus also how 
many) can be bonded together. In principle, there is no 45 
fundamental need for this limitation, nor is there a genuine 
lack of technical practicality involved (it could be done); it 
is simply felt that, since there are already nine Test Sites in 
a card cage, extending the DSY's adds significant cost for 
relatively little additional benefit. 50 

To resume our discussion of FIG. 1, then, consider the 
various Test Site Controllers 4a~4z that can populate the 
four card cages, each with nine Test Site Controllers. Let's 
denote them as 4a-4f, 4g-4w, 4«-4/ and 4w-4z. (Never 
minding, as explained earlier, that these are nominally only 55 
twenty-six subscripts — the reader is invited to imagine that 
there are another ten subscript symbols in there, someplace.) 
A CMD/DAT DSY 17a (Command & Data Daisy Chain) 
interconnects the Test Site Controllers 4a-4f that are in one 
card cage, while a different CMD/DAT DSY 17b intercon- 60 
nects the Test Site Controllers 4g-4m in another card cage. 
The same arrangement exists for the remaining card cages, 
and Test Site Controllers 4n-At and 4«-4z, respectively. We 
have earlier said that the DSY do not leave the card cages, 
in that "tail end" of a bus that actually forms the DSY does 65 
not leave a card cage and become the head of the next 
segment in another card cage. Instead, the System Bus 3 
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from the Test System Controller 2 goes to all Test Site 
Controllers, and each is capable of becoming a Master at the 
head of a DSY segment that does not leave the card cage. 

The CMD/DAT DSY \7a-d that we have been discussing 
exist between the various Test Site Controllers 4a-4z. There 
is a similar arrangement for the SYNC/ERR DSY 18a-18^ 
and the DUT Testers 6a-6z. The synchronization and error 
information conveyed by the SYNC/ERR DSY 18 allows 
DUT Testers to function in unison. These two daisy chains 
(17 and 18) carry slightly different types of information, but 
each exists as part of the same general mechanism for 
bonding one or more Test Sites together into a Test Station. 

We turn now to a discussion of FIG. 2, which is a 
simplified block diagram expansion of the DUT tester 6 of 
FIG. 1, of which there may be as many as thirty-six. It is 
sufficient at present to describe only one instance thereof. A 
glance at FIG. 2 will show that it is a fairly well populated 
with stuff; especially so for a "simplified" block diagram. 
Some of what is in the DUT Tester 6 and represented in the 
block diagram is functionally quite complicated, and is not 
available in "off the shelf form. It is appropriate here to 
make two points. First, the primary purpose of including 
FIG. 2 is to describe the basic properties of an important 
operational environment within the overall Non- Volatile 
Memory Test System 1. The inventions) that are fully 
described in connection with FIG. 3 and subsequent figures 
will either be expansions of mechanisms set out in the 
following description of FIG. 2, or they will be new mecha- 
nisms whose motivational premise is found in FIG. 2. Either 
way, as this is written it is not known exactly which of these 
is before the reader. The goal at present is to provide a 
simplified yet informative starting point for numerous dif- 
ferent Detailed Descriptions of various Preferred 
Embodiments, so that each of those can be as concise as is 
appropriate (as opposed to one "jumbo" Specification that 
discloses everything about each different invention). The 
second point is that the expanded or extended material, 
while in general overall agreement with FIG. 2, may contain 
information that does not "match-up" exactly with the 
simplified version. This does not mean there has been an 
error, or that things are fatally inconsistent; it arises because 
it is sometimes difficult or impossible to simplify something 
such that it is the exact image in miniature. The situation is 
rather like maps. A standard size road map of Colorado will 
show that when going east on 1-70 you can go north on 1-25 
at Denver. It looks like a left turn. And while it did used to 
be an actual left turn, it isn't one now, and a detailed map of 
that intersection will show a sequence of component turns 
and intervening road sections. But no one would say that the 
standard size road map is wrong; it is correct for its level of 
abstraction. Similarly, and despite its fairly busy appearance, 
FIG. 2 is indeed a simplification operating at a medium level 
of abstraction, but some seeming left turns are not simple 
left turns at all. 

As is shown in FIG. 1, the major input to the DUT Tester 
6 is an instance of the Test Site Bus 5, which originates from 
a Test Site Controller 4 that is associated with the instance 
of the DUT Tester 6 that is of interest. The Test Site Bus 5 
is coupled to a Multi-Bus Controller 88 that converts traffic 
on the Test Site Bus to traffic on a Ring Bus 85 or a VT Bus 
89. Ring Bus traffic can also converted to VT Bus traffic, and 
vice versa. Almost everything in FIG. 2 is part of some large 
scale integrated circuit; the Timing/Formatting & Compari- 
son circuit 52 (described below) is actually eight such IC's, 
although we show it as one entity for the sake of brevity. 
Save for the various Ext. DRAM's, most of the rest of the 
stuff in FIG. 2 is part of another large IC called the APG 
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(Automatic Pattern Generator). The Ring Bus 85 is a general some programmatically specified number of iterations or 
purpose inter-mechanism communication path for configur- other numerical condition. These PROGRAM CONTROL 
ing the major elements within the APG portion of the DUT FLAGS 25 are sent back to the Micro-Controller Sequencer 
Tester 6, and for setting modes of operation, etc. There also 19, where they affect the value of the instruction fetch 
various dedicated very wide and high speed paths between 5 address (created by Next Address Calculator 102) in ways 
various elements of the APG. The VT Bus 89 is an I C to IC familiar to those who understand about micro programmed 
bus for use within the DUT Tester itself. execution mechanisms. There are also various OTHER 
The Ring Bus 85 is the mechanism by which the Test Site FLAGS 55 that also can be used to effect program branch- 
Controller communicates with the APG portion of the DUT ing. These originate with various ones of the other mecha- 
tester 6. The Ring Bus 85 is coupled to a Micro-Controller 10 nisms within the DUT Tester 6 that are controlled by the 
Sequencer 19, which may be likened to a special purpose different fields of the fetched instruction word. One specific 
microprocessor. Using an address created by a Next Address additional flag is expressly shown as a separate item: VEC_ 
Calculator 102, it fetches instructions from a program stored FIFO_FULL 26. In another drawing having somewhat less 
in a program memory, which may be either internal to the detail it might be lumped in along with the OTHER FLAGS 
Micro-Controller Sequencer 19 (PGM SRAM 20) or exter- 15 55. We have separated it out to assist in explaining one 
nal thereto (EXT. DRAM 21). Although these two memories aspect of the operation of the Micro-Controller Sequencer 
appear to be addressed by what is essentially a logically 19. 

common address 63 that serves as a program counter (or, what VEC_FIFO_FULL does is to (temporarily) halt 

instruction fetch address), and either can be a source of as further program execution by the Micro-Controller 

programming to be executed, note that: (1) Only one of the 20 Sequencer 19. There are many stages of pipeline between 

memories performs instruction fetch memory cycles during tne instructions fetched by the Micro-Controller Sequencer 

any period of time; and (2) In fact they are addressed by 19 and tne mec hanism that finally hands test vectors off to 

electrically different signals. The SRAM is fast and allows be applied to the DUT In addition, part of the baggage that 

genuine random access, but consumes valuable space within accompanies a vector as it moves toward being applied to 

the Micro-Sequence Controller 19 (which is part of the large 25 tne DUT is information concerning the rate of eventual 

APG IC), so its size is limited. The external DRAM can be vector application, or, each vector's duration. Thus, the rate 

provided in adjustable amounts of considerable quantity, but G f vector application to the DUT need not be constant, and 

is fast only when accessed in sequential chunks involving m particular, a Group of vectors may take longer to apply 

linear execution and no branching. Programming in the than mey did l0 generate. The Micro-Controller Sequencer 

SRAM 20 is most often that which is intensely algorithmic, 30 s j mp i y executes programming at its maximum rate. But 

while the EXT. DRAM 21 is best suited for material not clearly, on average, the rate of "vector consumption," as it 

readily generated by algorithmic processes, such as initial- were) mU st equal the rate of Sector production," lest the 

ization routines and random or irregular data. pipeline need to be elastic nearly without limit. There is a 

The Next Address Calculator 102 can implement branch- Vector FIFO 45 at the output of the Address Mapper 29 

ing in the test program being executed, in response to 35 discussed below, and it serves as an elastic capacity in the 

unconditional jump either instructions or conditional jump pipeline. The signal VEC_FIFO_FULL is used to prevent 

or conditional subroutine instructions conditioned on vari- overrunning the limited number of stages in the pipeline, by 

ous PROGRAM CONTROL FLAGS (25), OTHER FLAGS causing a temporary cessation in the production of new 

(55), and certain other signals that, for clarity are shown vectors at the head end of the pipe. 

separately (DFE 0:3 103 and DPE 0:3 104) and which will 40 To continue, the (three times sixteen equals forty-eight 

be described in due course. bits of) x Y and Z address components 27 are applied to an 

The instruction word executed by the Micro-Controller Address Mapper 29, whose output is a selected-in-advance 

Sequencer 19 is fairly wide: two hundred and eight bits. It nearly arbitrary rearrangement of the address values in the 

consists of thirteen sixteen-bit fields. These fields often ordered forty-eight bit address space. Asa point of departure 

represent fetched instruction information for mechanisms 45 for appreciating this, suppose for a moment that the Address 

that are outside the Micro-Controller Sequencer proper. Mapper 29 were a memory that fully populated a forty-eight 

Such fields are dedicated to their associated mechanisms. bit address space, and that it held a forty-eight bit value at 

One set of ALU INSTRUCTIONS 22 are applied to a each address. (Temporarily never mind that such a memory 

collection of eight sixteen-bit ALU's 24, while others are would — today anyway — be size of a large refrigerator.) 

disbursed to various other mechanisms distributed through- 50 Given such a memory, a look-up table could be implemented 

out the DUT Tester. This latter situation is represented by the that could map any applied address into another, arbitrarily 

lines and legend "VARIOUS CONTROL VALUES & selected, forty-eight bit value which could then be used as a 

INSTRUCTIONS" 42. replacement address. The reason that such address mapping 

The eight sixteen-bit ALU's (24) each have a conven- is desirable is that the X, Y and Z address components 

tional repertoire of arithmetic instructions built around asso- 55 generally have useful meaning in the context of a particular 

ciated sixteen-bit result registers (each ALU has several DUT's internal architecture, which is most likely not imple- 

other registers, too). Three of these result registers and their mented with one big linear decoder. The notions of rows, 

associated ALU's are for generating X, Y and Z address columns and layers, block or pages may be very useful to the 

components 27 that are variously combined into a complete Test Engineer, and failures that occur in locations that are 

address to supplied to the DUT. Two more of the eight 60 physically close together may involve corresponding close - 

ALU/registers (DH & DL) are provided to assist in the ness in their X, Y and Z addresses. Such patterns in the test 

algorithmic creation of thirty-two bit data patterns 28 that results can be valuable in appreciating what is wrong and in 

are divided between a most significant portion (DH) and a trying to fix it, whether at a design level or at a production 

least significant portion (DL). A final three ALU/registers level of reprogramming a part to shunt a defective section's 

(A, B, C) are used as counters and contribute to the pro- 65 operation with that of a spare section. Two issues arise from 

duction of various PROGRAM CONTROL FLAGS 25 that such thinking. The first is paring the forty-eight bits down to 

assist with program control and branching on completion of the actual number of bits (say, thirty-two, or perhaps sixteen) 
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to be applied to the DUT. We shall shortly briefly mention 
how the paring down is done, and it is largely a matter of 
taking this many bits from X, that many from Y and the rest 
from Z. But not entirely, and this is the second issue, because 
certain addresses might lie within circuitry that is a left -for- 5 
right (or left-for-right and top-for-bottom) mirror image of 
another section of circuitry. This has the effect of rearranging 
what the bits mean, as far as what sequential address values 
are in physical order within that circuitry. This chip layout 
property may occur many times, and it may well be the case 
that how one Group of bits for, say, Y, are interpreted, may 
depend upon the accompanying value of some other, say, Z 
bits. The address mapper 29 is provided to allow the raw X, 
Y and Z addresses to be "repackaged," as it were, to reflect 
this sort of thing for the benefit of those who would test 
memories having such internal architectural arrangements. 15 
As to how its actually done, the Address Mapper 29 is 
constructed of a fairly large number of interconnected mul- 
tiplexers. It cannot implement the completely arbitrary look- 
up table behavior of a fully populated memory decode 
scheme as was temporarily assumed above for purposes of 20 
explanation. It can however, rearrange sub-fields of the X, Y 
and Z address components as needed, particularly since 
there is yet another mechanism that will do the paring down 
from forty-eight bits to the actual number needed. The 
Address Mapper 29 also contains three sixteen bit (address) 2 5 
look-up tables that allow it to perform limited arbitrary 
mapping within local ranges. 

The mapped address output 30 of the Address Mapper 29 
is applied as an address to a Buffer Memory 31 and to an 
Error Catch RAM 32, which, while having separate 30 
functions, may nevertheless be implemented as selectable 
partitions in the four Memory Sets that are collectively the 
Interior Test Memory 87. The mapped address output 30 is 
also applied as one input to an Addr. Bit Select circuit 37, 
whose multiplexing function is described in due course. The 35 
Interior Test Memory can be configured to contain many 
instances of various RAM's used for different functions. 
This is accomplished by declaring that certain portions of 
the different Memory Sets are to be used for the associated 
purposes. What is shown in FIG. 2 is one such arrangement; 40 
arrangements can be changed as testing proceeds, and this 
whole business of Memory Set usage should be considered 
to be very dynamic. None of the inhabitants of the Interior 
Test Memory (e.g., the error Catch RAM 32) are permanent 
hardware fixtures. What is permanent are the four Memory 45 
Sets. But which part of which Memory Set is an Error Catch 
RAM at any given time (if indeed there is even one defined) 
is dependent on whatever configuration has been estab- 
lished. 

Consider the Buffer Memory 31. Its function is to retain 50 
data patterns 33 and addresses 34 that can be applied to the 
DUT These are actual separate outputs from the Buffer 
Memory 31, although the Buffer Memory 31 is not a "dual 
port memory," but is preferably composed of portions of two 
different Memory Sets. In keeping with this, it is preferred 55 
that Stored Data 33 is kept in one Memory Set, while Stored 
Addresses 34 are kept in another. Also, we have not shown 
an explicit mechanism for writing to the Buffer Memory 31. 
One way that may be accomplished is by an addressed bus 
operation initiated by a Test Site Controller 4 at the behest 60 
of the program it is executing. There is an "under the 
floorboards," as it were, "utility services" bus called the 
Ring Bus 85 that goes to just about everything in FIG. 2 
(most of the visitations of which are not shown — as that 
would clutter the drawing immensely). Another and faster 65 
way of writing information to the Memory Sets is described 
in connection with FIG. 3. 



,844 Bl 

16 

The Error Catch RAM 32 is addressed by the same 
address that is applied to the Buffer Memory 31, and it either 
stores or retrieves information about errors, which opera- 
tions are performed in conjunction with a Post Decode 
Circuit, to be discussed later. As with the paths 33 and 34 
from the Buffer Memory 31, paths 61 (into the Error Catch 
RAM) and 62 (from the Error Catch RAM) are preferably 
MUXed outputs from a portion of a Memory Set (declared 
to be the current Error Catch RAM 32), in accordance with 
configuration information distributed by the Ring Bus (not 
shown). 

Note that the Data MUX 35 has as inputs the STORED 
DATA output 33 from the Buffer Memory 31 as well as data 
28 from the registers DH and DL in the collection 24 of 
ALU's. The Data MUX 35 selects which of these inputs (28, 
32) to present as its output 38, which is then applied as one 
of two vector components to a Transmit Vector Mapper/ 
Serializer/Receive Vector Compare Data Circuit 40 (the 
other component is the output 39 of the Addr. Bit Select 
circuit 37). Data MUX 35 performs this selection in accor- 
dance with values 36 stored in PGM SRAM 20. 

Circuit 40 can perform three vector related functions: 
assemble vector components (38, 39) into an ordered logical 
representation an entire vector that is to be applied 
(transmitted) to the DUT; apply an arbitrary dynamic cor- 
respondence (mapping) between the ordered bits of the 
logical representation of the transmit vector and the actual 
physical channel number of the Pin Electronics (i.e., which 
probe tip) will contact the DUT on behalf of that signal (i.e., 
that bit in the vector); and, cooperate with the compiler in the 
division of an entire logical vector into pieces to be applied 
separately and in order (serialization) for DUT's that admit 
of such a thing. Which of these functions is performed is 
determined by control signals from an SRAM 41, which is 
also addressed in accordance with a field in the two hundred 
and eight bit instruction fetched by the Micro-Controller 
Sequencer 19. 

Also contained within circuit 40 is a section of DUT 
Disable Logic 90, Its purpose is to respond to various 
conditions, some static, some contingent on test outcomes, 
but all defined program matically, that indicate which one or 
more DUT's, among as many as four thereof, are to be 
disabled. These indications are carried by four signals DD 
0:3 44b (DUT Disable for DUT Zero, for DUT One, etc.) 
This is in support of multi-DUT testing on a Test Site, and 
is further explained in due course. The output of Circuit 40 
is an up to sixty-four bit vector 44a that, along with the DUT 
Disable signals 446, is applied to a Vector FIFO 45, which 
when full generates the signal VEC_FIFO__FULL 26, 
whose meaning and use was discussed above. The vector at 
the top of the Vector FIFO 45 is removed therefrom upon 
receipt of a signal VEC_FIFO_UNLOAD 47 that origi- 
nates at a Period Generator 49 (to be discussed shortly). 
Such removed vectors (46) are applied to a Timing/ 
Formatting & Comparison circuit 52 that is connected to the 
DUT via the associated instance of Pin Electronics 9. That 
is, each instance (among the various Test Sites) of Pin 
Electronics 9 receives Transmitted & Received Vectors 7 
and Pin Electronics configuration information 8 from its 
associated Timing/Formatting & Comparison circuit 52. 

The Timing/Formatting & Comparison circuit 52 is 
coupled to the VT Bus 89 to receive configuration and 
control information. It will be recalled that the Timing/ 
Formatting & Comparison circuit 52 is actually eight IC's, 
which for our purposes we are treating as a single entity. 

The Timing/Formatting & Comparison circuit 52 has an 
Internal SRAM 54 addressed by the same Instruction 
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Address ("A" in the small circle) as is the Program SRAM 
20 of the Micro-Controller Sequencer 19. (An External 
DRAM 53 may be used in place of the Internal SRAM 54, 
but is locally addressed by an incremented counter that is not 
shown.) The Internal SRAM 54 (or external DRAM 53) 
assists in the production of Drive and Comparison cycles, 
which have associated formats. Drive cycles apply a trans- 
mit vector to the DUT using a pre-selected format supplied 
by one of RAM's 54 or 53. Comparison cycles receive a 
vector presented by the DUT and examine it, also according 
to a pre-selected RAM-supplied format, to determine if it 
matches previously supplied comparison data. Both Drive 
and Comparison cycles are adjustable as to their duration, 
and appropriately adjustable as to whether and when a load 
is applied, when data is latched or strobed, if a signal is 
Return-To-Zero or not, whether to surround a driven signal 
with its complement, etc. (These options are the various 
formats mentioned above.) 

The comparison produced by the Timing/Formatting & 
Comparison circuit 52 includes information, on a per chan- 
nel basis, about whether a channel failed because a logical 
value was wrong (a functional error) and/or because its 
electrical properties are outside acceptable limits (a para- 
metric error). Furthermore, and as will be explained in due 
course, when multiple DUT testing is performed it is known 
which channels are associated with which DUT's. This 
allows the production of the four signals DFE 0:3 (DUT # 
Functional Error) 103 and the four signals DPE 0:3 (DUT # 
Parametric Error) 104. The use to which these eight signals 
is put will be described later. 

The comparison performed by the Timing/Formatting & 
Comparison circuit 52 also produces a sixty-four bit value 
56 that is applied to a Receive Vector Reverse Mapper/ 
Deserializer 57, whose function may be considered to be the 
logical inverse of circuit 40. (The operation of circuit 57 is 
controlled by an SRAM 58 that corresponds to the control of 
circuit 40 by SRAM 41.) In turn, the output 59 of circuit 57 
is applied to the Post Decode circuit 60. At present, it is 
sufficient to say that the Post Decode circuit 60 can inspect 
via programmatic criteria both incoming error information 
59 and (previously) stored error information 60 (stored in 
Error Catch RAM) to produce condensed and more readily 
interpretable error information which may then by stored 
back into the Error Catch RAM 32 via path 61. An example 
would be to create a count of how many times there was an 
error within a particular range of addresses, which informa- 
tion may be useful in deciding when to attempt to engage in 
on-chip repair by enabling substitute circuits. 

We turn now to the Period Generator 49 and its associated 
Timing SRAM 51. These respond to an eight bit signal 
T_SEL 43 that, for each two hundred and eight bit instruc- 
tion fetched by the Micro-Controller Sequencer 19, deter- 
mines a duration for the associated operation of the Timing/ 
Formatting & Comparison circuit 52. T_SEL43 is member 
of the Various Control Values & Instructions 42 that are 
represented by the different fields within the fetched instruc- 
tion. As an eight bit value it can represent or encode two 
hundred and fifty-six different things. In this case those 
"things" are twenty-eight bit values stored in the Timing 
SRAM 51 and that are addressed by T__SEL. Each 
addressed twenty-eight bit value (23) specifies a desired 
duration with a 19.5 picosecond resolution. The sequence of 
accessed twenty-eight bit duration values (23) is stored in a 
Period FIFO 50 so that the individual members of that 
sequence will be retrieved and applied in synchronism with 
the retrieval of their intended corresponding vector, which is 
stored in the Vector FIFO 45. 
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A coarse timing value field in the oldest entry in the FIFO 
50 conveys duration information with a resolution of 5 nsec, 
and produces therefrom a signal VEC_FIFO„UNLOAD 47 
that transfers the next transmit vector from the Vector FIFO 

5 45 to the Timing/Formatting & Comparison circuit 52. A 
companion signal TIMING REMAINDER 48 is also applied 
to circuit 52. It is there that the ultimate resolution to 19.5 
picoseconds is accomplished. 

Refer now to FIG. 3, which is a simplified block diagram 

jo 64 of the Interior Test Memory 87 in the block diagram of 
FIG. 2. It receives a forty-eight bit mapped address 30 from 
the Address Mapper 29, which is applied to various Address 
Classifiers 77, 78 and 79. The Address Classifiers are 
associated with Memory Sets 73-76, which are each com- 

35 plete memory mechanisms that can individually perform 
various functions, such as being an ECR 32. Two of these 
Memory Sets (73, 74) are of external DRAM, while two are 
of internal SRAM. The two external DRAM Memory Sets 
will always have the same Address Classifier function in 

20 effect, and thus share one common Address Classifier 77. 
The internal SRAM Memory Sets 75 and 76 each have their 
own associated Address Classifiers, 78 and 79, respectively. 
These Address Classifiers can either pass an address through 
unchanged, or modify it in ways to be described in some 

25 detail in due course below. 

Each Memory Set includes a Memory Set Controller; the 
external DRAM Memory Sets 73 and 74 have DRAM 
Memory Set Controllers 65 and 66, respectively, while the 
internal SRAM Memory Sets 75 and 76 have respective 

30 SRAM Memory Set Controllers 67 and 68. During the 
testing of a DUT the address for memory transactions 
directed to any of these Memory Sets arrives at the associ- 
ated Memory Set Controller from the respectively associated 
Address Classifier. During the testing of a DUT Error Data 

35 61 arriving from the Post Decode circuit 60 and that is to be 
written into an ECR is first applied to Data Classifiers 
80-83, one of which is associated with each Memory Set. 
The function of the Data Classifiers will be described in due 
course below. They may or may not change the data applied 

40 to them, depending upon how they are configured and the 
function they are to perform. The Address and Data Clas- 
sifiers represent high speed paths for addresses and data, 
respectively, which are intended to operate at the highest 
speeds necessary. We shall shortly see that the Ring Bus (not 

45 yet shown) provides another way to convey addresses and 
data to the Memory Sets. 

At this point we have four Memory Set Controllers 
(65-68) that each have incoming (classified) addresses and 
(classified) data. Each of these Memory Set Controllers is 

50 coupled to an associated memory: DRAM Memory Set 
Controllers 73 and 74 are respectively coupled to external 
DRAM's 69 and 70, while SRAM Memory Set Controllers 
75 and 76 are respectively coupled to internal SRAM's 71 
and 72. These arrangements constitute the four Memory Sets 

55 73-76, two of which (75, 76) have modest amounts of high 
speed SRAM, and two of which (73, 74) have large amounts 
of slower DRAM. What is of interest to us at present is how 
the DRAM Memory Sets can be made as fast as the SRAM 
Memory Sets, as well as how to incorporate certain alter- 

60 natives concerning configuration of the DRAM, depending 
upon user preference and test program strategy. Thus, it is 
going to turn out that the DRAM Memory Set Controllers 65 
and 66 are configurable, perform different types of memory 
transactions, and are not altogether the same as the simpler 

65 SRAM Memory Set Controllers 67 and 68. For the sake of 
brevity, FIG. 3 does not show the structure that provides this 
flexibility; for now let's just say that each Memory Set 
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Controller is connected to the Ring Bus (not yet shown), 
from which it is instructed in the particular mode of opera- 
tion and configuration that are desired. Some of these modes 
involve how data is stored, and some have to do with getting 
it back out again. To conclude, then, note that each Memory 5 
Set does have an associated Data Out (62A-D) which is sent 
to the Post Decode Mechanism 60 for further processing. 
Note also that the data outputs from Memory Sets zero and 
two are applied to a MUX 84 whose output becomes 
STORED DATA 33 that is sent to Data MUX 35. Similarly, 
the data outputs from Memory Sets one and three are applied 
to a MUX 127 whose output becomes STORED 
ADDRESSES that are sent to the Addr. Bit Select MUX 37. 
The reason for having MUX's 84 and 86, and the details of 
how they are controlled will be discussed in due course J5 
below. 

Memory Set Two 75 receives a Bad Block Mode signal 
105 that is not received by any other Memory Set. The 
Controller 67 for Memory Set Two uses this signal to 
support a special mode of operation to be described in due 20 
course. What should be understood at this point is that when 
Bad Block Mode signal 105 is TRUE, and it is not the case 
that a write memory cycle is being initiated, then the 
presentation of a new address will cause an automatic read 
cycle to occur at that address. (A special Bad Block table is 25 
being read, and how the data is snagged and put to use is a 
subject for latter discussion.) 

Now consider FIG. 4, which is a block diagram 91 of a 
section of DUT Disable Logic located within circuit 40. It 
has three MUX's 92, 93 and 94 that function as switches to 30 
enable various conditions to disable any or all of the one to 
four DUT's being tested in a multi-DUT fashion. Our names 
for these four DUT's are DUTO, DUT1, DUT2 and DUT3. 
Our immediate goal is to produce corresponding DUT 
Disable signals DD0:3 (98a-d, which are also 44£>). There is 35 
a three-bit register 95, called the DUT Disable Selector 
Register, whose outputs are individually coupled to the 
respective control inputs of the three MUX's 92, 93 and 94. 
What each MUX does is select between a collection of four 
zeros (disable no DUT's) and a collection of four signals that 40 
do indicate a DUT to be disabled. Since there are three 
MUX J s, there are three kinds of conditions that disable any 
of the four DUT's. Any or all of the three conditions may be 
enabled at any given instant in time, depending upon which 
bits are set in register 95. 45 

The easiest condition to appreciated is associated with 
Bad DUT Register 96, a four-bit register coupled to the Ring 
Bus and settable by the test program. Each of its four bits 
corresponds to a DUT, and if that bit is set, and MUX 93 is 
set to select Bad DUT Register 96 instead of its four zeros, 50 
then that bit (and perhaps not just one, either) is passed 
through the MUX 93 and OR gates 97a-d to become the 
associated DUT Disable signal. Think of this condition as a 
way to deliberately disable a DUT 

The next condition is pretty easy to appreciate, once it is 55 
understood that during multi-DUT testing the test program 
writer is compelled to associate four of the PROGRAM 
CONTROL FLAGS 25 with four respective DUT's. (These 
four flags cannot be used for other purposes during multi- 
DUT testing, as they might during single DUT testing. This 60 
is thought to not be much of a burden. ) These four flags are 
named A^, B min , C min and DH mt „. They are associated with 
the four ALU's A, B, C and DH of circuit 24 in FIG. 2. These 
ALU's are sophisticated, and have output-value registers 
that can be pre-set, the ALU's can be told to decrement on 65 
command to do so, and a lower limit register associated with 
each ALU can be pre-set. A comparison mechanism asso- 
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ciated with each ALU produces the flags A^, B minl C min 
and DH^ when the limit for that ALU is reached. Thus, by 
agreeing to use the A ALU for, say, DUTO, a variable number 
of events of interest can be registered by decrements to ALU 
A, which will, after a threshold of tolerance is exhausted (the 
limit reached) set the flag A^ and, by action of block 
diagram 91 in FIG. 4, indicate that DUTO is to be disabled. 

It will be appreciated that each of the ALU's in circuit 24 
of FIG. 2 is a sixteen-bit ALU. This cooperates nicely with 
multi-DUT testing, since we assume that each of the DUT's 
in a multi-DUT set-up is identical, that even a single DUT 
in non multi-DUT testing is thirty-two bits or less, so that the 
worst multi-DUT case is sixteen bits (two six-bit parts to 
occupy thirty- two bits). To be sure, exactly what these 
ALU's get used for is up to the programmer who writes the 
test program. It is a fairly easy guess to assume that a four- 
or eight-bit ALU is of very limited use while testing a 
sixteen-bit part. Likewise, it is perhaps not an absolutely 
sure thing that a sixteen-bit ALU is sufficient when used as, 
say a counter, but it very probably is, and there has to be a 
limit, somewhere. 

The third condition is perhaps more involved, although it 
still boils down to four signals that correspond respectively 
to the (as many as) four DUT's being tested in the multi- 
DUT mode. In this case the four signals are the four least 
significant bits of some word in a table in memory. The 
addressed entry in the table originates with the address being 
used to create the transmit vector (notice we didn't say they 
have the same address, although they could), and the data in 
that word has been worked on to give it special significance 
with something called Data Classification. Leaving out the 
how, those four least significant bits can be fixed up to 
represent the act of addressing of a bad block within a 
particular DUT, which DUT can then be disabled. (The 
different bits within the four LSB's represent the different 
DUT's, and different addressed words within the table 
represent the different blocks with a DUT.) This mechanism 
can be used to keep the other DUT's enabled and just shut 
down the DUT having the bad block while the bad block is 
being addressed, to avoid increased execution time and 
unnecessary drive cycles in the test program by executing to 
the limits of the loop indices. (For some devices the lifetime 
of the part is limited by the number such cycles that are 
possible, so we don't want to use them up needlessly.) The 
effect is to disable only the bad block rather than treat the 
entire DUT as in need of disabling all the time. The 
underlying idea is that the bad block in that DUT might be 
repairable, and that we need to continue testing for the other 
blocks in that DUT until we know for sure that the DUT is 
too far gone to be repaired. 

The signal BAD BLOCK MODE 105 is TRUE whenever 
MUX 94 is used to enable the DUT-disble-by-bad-block- 
table technique described above. It is used by a particular 
Memory Set designated at the factory to be the one that is to 
cooperate by holding the bad block table. This association 
could be user selectable, but so far not one has found a good 
reason to make it so. Also, we refer that the Memory Set 
involved be an SRAM Memory Set. What BAD BLOCK 
MODE signal 105 does is to tell the Memory Set Controller 
to automatically read when a new address 30 is presented. 
This is done to make table entries in the bad block table 
(which will be in Memory Set Two, and indexable by a 
previously arranged address classification paradigm) auto- 
matically available at the same rate as the test program needs 
them. Various examples and applications for the techniques 
of Address Classification and Data Classification are set out 
in the incorporated disclosure entitled MEMORY TESTER 
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HAS MEMORY SETS CONFIGURABLE FOR USE AS contents ate set by traffic on the I VT Bus 112. The contents 

ERROR CATCH RAM, TAG RAM's, BUFFER MEMO- of register 123 determine just what comparisons are made to 

RIES AND STIMULUS LOG RAM. form the digital pass/fail indication for the channel as a 

To this point we have described how to indicate which rcsuh of the current receive vector c y cle - Tbat si 8 Dal '* the 

DUT's are to be disabled, assuming we can decide to do so 5 familiar COMPARE error signal 56, of which signal 125 is 

programmatically (and we do so assume). We have not yet ° n l? ut f of ™? b u S1 6 nals seDt . t0 C1 . rcuu " 10 FlG - 

shown how a DUT is actually disabled. It is to tbat task that f.T 1 * f ™ ™1 ^ ^ItI^ m \ t TH 

we now turn indicated by FORMAT 140 and DATA 139, each of which 

_ , _ is coupled to the Formatted Receiver 124. As with the Drive 

Refer now to FIG. 5, which is a simplified block diagram Formats Gene rator, the Formatted Receiver recognizes four 

^?™L W ^ C ^ m °f n ?o£? 10n ° f ^ V M ™ G ' of *e eight encodings, ignores drive formats, and does not 

FORMATTING & COMPARISON circwt 52 of FIG. 2. form , £m palisoa * nle * i( „, a receive format on lines 

What block d.agram 106 descnbes is how various errors are 140 ^ receive formats ma , are implemented m compare 

detected and subsequently related to individual DUT's dur- ^ n0 load> mmpm ^ load (load value may be s d . 

mg multi-DUT testing, and how individual DUT's are fied per channel)> between Hig h and Low, and do 

disabled Most of what is depicted in FIG. 5 is on a per not are receive (comparison) formats ^ be 

channel basis, which is to say '/« of the actual amount, with appreciated „ conventional by those skilled in the art, and 

the further understanding that in an actual implementation are believ6d to ao explanation. 

th f 6 . h SiXt ^? U L inSt , an , CeS , are r 055 ei8 , ht ^ hiP f '- ^ To continue, then, the Formatted Receiver 124 produces 

of the eight chips (which we never show as individual items , si j FUNCTIONAL ERROR 126 that means there was 

is coupled to the VT Bus 89, and replicates it as an internal M „ rJS,,,. ,„ „ „ . „ . , , •„„, „ „„, ,. 

• .u e ii j.L nVr r. iit it . i \rr -a \ a failure to compare at the level of logical representation, 

version thereof called the IVT Bus 112 (Internal VT Bus). ,_j „ • „, n/nAuPTDir' cddod i« .u . 

_ c. -.u- .u • « • -i iD ° a signal PARAMETRIC ERROR 127 that means a 

Vlu TV TZZr ™ h C CirC f U mr? T-! P arametric measurement exceeded associated limits. These 

o that of the Ring Bus in the APO portion of FIG. 2; it is s[ h m d ^b\o, regardless of the content of F/P 

the under-the-floorboards utility communicattons mecha- ^ £rror ^ Registef m J* ^ meanmg Qf ^ s{gnal 

D1Sra ' . COMPARE ERROR 125. The sense of the signal FUNC- 

To continue, block diagram 106 includes a Drive Formats TIONAL ERROR 126 that denotes an error is captured in a 

Generator 107, whose job it is to take three bits of FORMAT C H FE Latch 128 (Channel Functional Error Latch). CH PE 

140 and one bit of DATA 139, all associated with a particular 129 does the same for the signal PARAMETRIC 

channel, and vanable on a per vector basis, and create the 3Q ERROR 127. These latches impart "stickiness" to their 

drive signal actually applied to a pin or pad 109 of a DUT. respective meanings, in that an error indication in a channel 

There are four drive formats, and they are NR (Non Return), for one veclor (here a logic one) ^ persist despite the 

RZ (Return to Zero), RTO (Return to One) and SBC absence of actual error conditions for subsequent vectors, 

(Surround By Complement). These drive formats will be until such time ^ the latch is reset> ^ signals m and 

appreciated as conventional by those skilled in the art, and 35 127 can ^t, but not clear, their respective latches 12S and 

are believed to require no further explanation. When accom- 129 . However, CH FE Latch 128 and CH PE Latch 129 are 

pamed by the DATA value for the channel in the vector, the coupled to the IVT Bus 112, so that they may be reset when 

indicated FORMAT determines much of what is applied to desired . Besides being reset by actioQ of the IVT BuSf they 

the pin 109. The signal that leaves the Drive Formats may ^ be read t0 discover what condition produced the 

Generator is applied to a level shifter (actually part of the Pin ^ failure to compare 

Electronics 9) after which it (121) is connected to the pad In singk DlJT operatioQ the tQree . bit F0RMAr descrip „ 

109 via a probe. tor ^ its acC ompanying one-bit value for DATA are 

There are also four receive formats, making a total of supplied by the fNT, SRAM 54, addressed by the circled A 

eight formats. The Drive Formats Generator 107 gets a three originating in the Micro-Controller Sequencer 19. There is 

bit signal 140 that indicates which one of the eight formats 45 0 ne such set of signals for each channel. Thus, we see that 

is in effect at any instant in time. It also gets the data signal INT.SRAM 54 functions as an local extension of PGM. 

139. The Drive Formats Generator 107 ignores encodings SRAM 20 in the Micro-Controller Sequencer 19. This 

that are not dnve formats, and does not drive if it does not capability allows the test program to specify for every vector 

receive a drive format. its 0 wn FORMAT/DATA at each bit position. If there were 

Also connected to pin or pad 109 via conductor 121 are 50 no multi-DUT testing or the possibility of bad blocks in a 

Receive Comparators 117 and 118 (which may also be part single DUT, this would be sufficient, and is essentially a 

of the Pin Electronics 9). Each Receive Comparator is also conventional arrangement. 

coupled to a corresponding threshold voltage, CH VOH and However, as previously mentioned, we desired to test 

CH VOL (Channel Voltage Out High and Channel Voltage multiple DUT's at the same time and on the same Test Site, 

Out Low). The Receive Comparators are coupled to Receive 55 and we wish to cope more effectively with bad blocks in 

Latches 119, whose combined values for any given receive memory devices that have blocks. To that end, we seek 

cycle can indicate High, Low or Between. alternative FORMATS and DATA that can be selectively and 

A Parametric Measurement Unit 120 is coupled to con- conditionally invoked according to the turn of events during 

ductor 121, as well. It performs measurements of analog testing. The invocation of such alternate formats is fre- 

parameters, such the current flowing through conductor 121. 60 quently referred to as "jamming." An alternate FORMAT is 

The output of the Parametric Measurement circuit 120 and stored in a Jam Format Register 111, and corresponding 

the content of the Receive Latches 119 are applied to a alternate DATA is stored in a Jam Data Register 113. Both 

Formatted Receiver 124, whose function is to create the a of these registers get their content from the IVT Bus 112. 

digital indication of the desired comparison for this channel Furthermore, which source of FORMAT (INT.SRAM 54 or 

within the current vector's receive cycle. To this end, the 65 register 111) is used is determined by MUX 110, in accor- 

Formatted Receiver circuit 124 is further coupled to the dance with instructions received from a Format & Data Jam 

contents of an F/P Error Select Register 123, and whose Control Logic circuit 115. MUX 114 does the same for 



03/19/2004, EAST version: 1.4.1 



US 6,671 

23 

DATA. The Format & Data Jam Control Logic circuit 115 
receives configuration and mode of operation instructions 
from a Jam Control Register 116, also coupled to the IVT 
Bus 112. It is switching one or both of the MUX's 110 and 
114 to their Jam Register positions that, at bottom, allows a 5 
DUT to be disabled, or its results faked as being good. Of 
course, the exact results of jamming depend on the alternate 
FORMAT and/or DATA that are jammed in place of the 
primary FORMAT and DATA. 

Circuit 115 also receives signal 141, which is a sticky 10 
version of the signal FUNCTIONAL ERROR 126, a signal 
142 meaning that a DUT selected for being Disabled affects 
this channel, a signal 143 that means that a DUT affecting 
this channel has been determined to exhibit a functional 
error, and, a similar signal 144 that means that a DUT 15 
affecting this channel has been determined to exhibit a 
parametric error. In all cases, the DUT associated with 
signals 142-144 are the same DUT, as determined by a DUT 

# Register 132 whose content (a value indicating one of 
DUTO, DUT1, DUT2 or DUT3) is set by the IVT Bus. That 20 
content (value identifying a DUT) is applied as a control 
input 136 to MUX 133, MUX 134, MUX 135 and to two 
other circuits not yet mentioned: the Functional DUT # 
Decoder 131 and Parametric DUT # Decoder 130. 

The DUT # Register, of which there is one per channel, is 2 s 
not set to indicate which DUT a test program is interested in. 
It is used to indicate which DUT of multiple DUT's this 
channel is associated with, and typically is set at the begin- 
ning and left alone. Thus, the totality of the DUT# Registers 
establishes the correspondence between the DUT's of multi- 30 
DUT testing and the channels used to test those DUT's. 

To conclude our initial description of block diagram 106, 
the signals 44b from logic circuit 90 in circuit 40 of FIG. 2 
are applied to MUX 133. These are the DUT Disable DD0:3 
signals described in connection with FIG. 4, and stand for 35 
certain ways (although not all of the ways) of deciding that 
a DUT is to be disabled (as were described in connection 
with FIG. 4). So, suppose DUT1 were to be disabled, as 
indicated by DDI at MUX 133. For each instance of block 
diagram 106 that is associated with DUT1 the signal DDI 40 
would pass through MUX 133 to become signal 142. The 
import here is that logic circuit 115 (i.e., that instance thereof 
for such a channel) would be told that this channel 
(whichever it is) is to have some form of alternate format or 
data invoked, instead of the primary format thereof that is 45 
used for initial testing until the discovery that selected 
(programmatically determined) error criteria have been met. 
What "some form" means is determined by the content in the 
associated instance of register 116, and we shall describe the 
range of possibilities shortly below. To reiterate, the sixty- 50 
four DUT # Registers 132 are the core of the channel-to- 
DUT correspondence mechanism, and should not be thought 
of as identifying a DUT, but as including or excluding this 
channel in some action on behalf of a DUT it is associated 
with. One level we may speak of "disabling a DUT" while 55 
on another we recognize that at the lowest level we only ever 
do something different for those channels that represent the 
DUT. And we might not do whatever that thing is to all of 
the channels, and we might do different things to different 
channels. 60 

Now consider Parametric DUT # Decoder 130. It receives 
the latched and sticky PARAMETRIC ERROR and the DUT 

# signal 136 from register 132. What decoder 130 does is 
map its input onto a corresponding line representing the 
DUT, and that is then wired OR'ed with that same function 65 
for each of the other channels. This wire OR'ing produces 
the DUT Parametric Error signals 104 (DPE0, DPE1, DPE2 
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and DPE3), which are sent out from circuit 52 to the Next 
Address Calculator 102 in Micro-Controller Sequencer 19 of 
FIG. 2, and are also applied to MUX 134, so that the 
appropriate one thereof can become signal 143. A similar 
arrangement obtains for Functional DUT # Decoder 131, 
and the DUT Functional Error signals 103. 

What block diagram 106 does with regard to controlling 
MUX's 110 and 114 is determined by the Format & Data 
Jam Control Logic circuit 115. Here is a description of its 
functional attributes: 

(A) Signal 142 goes true because of a decision taken by 
the test program, or any other event that would cause 
one of DUT Disable signals 44b. That means that this 
channel is associated with a DUT that is to be disabled, 
and therefore, that the alternate FORMAT and/or DATA 
definitions are to be invoked. 

(B) This channel is explicitly to be disabled. This is 
indicated by a bit in Jam Control Register 116. The 
result is that the alternate FORMAT and/or DATA 
definitions are to be invoked. 

(C) Invoke the alternate FORMAT and/or DATA defini- 
tions for this channel if this channel is associated with 
a DUT that has been determined to exhibit a DUT 
Functional Error (a DFE signal becomes signal 144). 

(D) Invoke the alternate FORMAT and/or DATA defini- 
tions for this channel if this channel has been deter- 
mined to exhibit a channel Functional Error (signal 
141). This is what can be used to prevent overprogram- 
ming. 

(E) Invoke the alternate FORMAT and/or DATA defini- 
tions for this channel if this channel is associated with 
a DUT that has been determined to exhibit a DUT 
Parametric Error (a DPE signal becomes signal 143). 

(F) (C), (D) and (E) can be selected to occur continuously 
until the associated error occurs, or begin continuously 
afterwards. 

These attributes of circuit 115 appear complex; actually 
they are simply more numerous than convoluted, and it will 
be appreciated that their implementation is essentially the 
reduction of the binary input signals to a logic equation that 
can then be synthesized by any of several well known 
techniques. 

It will be appreciated that we have described a mechanism 
for invoking alternate FORMAT'S and DATA, on a vector by 
vector basis, and for each channel within a DUT. The 
FORMAT includes both drive and compare components, 
and a number of useful features are obtained by proper 
selection of these alternates. For example, a channel can be 
made to appear good, even when it is not, by jamming a "no 
compare" as the alternate FORMAT The alternate FOR- 
MAT can specify a drive format in place of a compare 
format, with the result that no comparison is performed and 
no error is generated. (We assume, of course, that the 
programmer knows the circumstances in which this is 
useful.) Such drive removal is also good for preventing 
overprogramming by automatically jamming, on a pin/ 
channel basis, the data sent to the DUT once that bit appears 
successfully programed. 

Refer now to FIG. 6, which is a block diagram 145 of 
some logic circuitry located within the Next Address Cal- 
culator 102 of the Micro-Controller Sequencer 19 of FIG. 2. 
This circuitry is there to augment the branching on DUT 
error condition described in the incorporated Application 
entitled METHOD AND APPARATUS FOR 
NO-LATENCY CONDITIONAL BRANCHING. What the 
circuitry does, in accordance with a DUT ERROR SELECT 
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FIELD 146 in the 208-bit instruction word from PGM. 
SRAM 20, and the application of that field to MUX 147, is 
select either a specified one or the OR of any of the DUT 
Functional Error signals 103 to become signal 149 YDFE 
(Yes DUT Functional Error). A similar selection from DUT 5 
Parametric Error signals 104 by MUX 148 produces signal 
150 YDPE (Yes DUT Parametric Error). It is the signals 149 
and 150 that are used to effect DUT error related branching 
in the Micro-Controller Sequencer's test program. These 
signals (149 and 150) are what are respectively referred to 10 
as "a program functional error flag" and a "program para- 
metric error flag" in the incorporated disclosure mentioned 
earlier in this paragraph. 

Consider the following segment of pseudo code for a test 
program, wherein the line numbers in parentheses are for is 
reference only: 

(1) format DUT0 drive_pattern0, format DUT1 drive_ 
patternl, format DUT2 drive__pattern2, format DUT3 
drive_pattern3; 

(2) format DUT0 check__patternO, format DUT1 checks 20 
patternl, format DUT2 check_pattem2, format DUT3 
check__pattern3; 

(3) errsel-0, jump (ferr) DUT0_ERROR; 

(4) errsel-1, jump (ferr) DUH_ERROR; 25 

(5) errsel=2, jump (ferr) DUT2_ERROR; 

(6) errsel-3, jump (ferr) DUT3_ERROR; 

This segment of code would be nested within a loop that 
supplies an address to the DUT that changes once per 
iteration of the loop. If there are no error the jumps at lines 30 
(3)-(6) will not occur. However, suppose that there is an 
error associated with DUT1. Then line four will cause a 
jump to the routine named DUTl_ERROR. This happens 
because the compiler of the test program arranged for the 
micro code corresponding to line (4) of the pseudo code 35 
segment to put into the DUT ERROR SELECT field (146 of 
FIG. 6) a value that causes MUX's 147 and 148 to select 
signals DFE1 and DPE1 to become signals YDFE (149) and 
YDPE (150), respectively. Different locations in the micro 
code will have different values for field 146. Thus, for line 
(6) signals DFE3 and DPE3 are selected. 40 

The result of the foregoing disclosure to this point is a 
powerful combination of programmable and configurable 
mechanisms that will autonomously disable specific 
channels, or those channels related to one DUT from among 
many, as is needed to support multi-DUT operation and 4 5 
certain other useful features related to the testing of non- 
volatile memories, without requiring multi-threaded pro- 
gramming. 

One particular feature deserves further attention. Refer 
now to FIG. 7, wherein is depicted various example tables 50 
in memory that may be used, in conjunction with subject 
matter set out above, to facilitate the testing of memory 
DUT's that have an internal block structure. Suppose that we 
are to test a DUT that has a block structure, and further that 
it is only an eight-bit device, so we are emboldened to test 55 
four of them at a time. The desire is to be able to selectively 
disable the channels that are associated with a block/DUT 
combination when we have discovered that it is bad. We 
want to do this to remove multiple thread considerations, 
and perhaps also to keep from unnecessarily banging into 
loop indices and issue more drive cycles to the DUT than are 60 
needed. 

Consider two tables in the SRAM Memory Sets (these 
tables are not so big as to need the DRAM Memory Sets, and 
we prefer the SRAM to the DRAM for other reasons that are 
not pertinent here). In particular, establish a WORKING 65 
RESULTS Tag RAM 154 in Memory Set Three 76. Use the 
narrow word feature to set the effective word width to four 
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bits, and assume the bit position/DUT correspondence to be 
the LSB in the table is DUT0, and the MSB is DUT3. 
Different addresses in the table represent the different blocks 
in the DUT's. Pre-load the table, and another one 155 just 
like it in Memory Set 2 (75), with all ones (all blocks good, 
no failures). 

Now begin testing, which we assume takes the form of a 
series of loops, perhaps with each instance of the loop 
stepping through the blocks to perform a different type of 
test. Each instance of the loop can decide that a block is bad 
(although the possibility of repair is not being ruled out by 
anything said here). Using well known Address Classifica- 
tion and Data Classification techniques, table 154 operates 
as a Tag RAM to register bad blocks indexed by table 
address, with DUT's indexed by bit position. Do a first pass 
through the loop. During this time the four bits of table 155 
will be read out each time there is a new address to Memory 
Set Two (remember the magic of the signal BAD BLOCK 
MODE 105) and will be applied to MUX 94 of FIG. 4, 
which also will have been enabled to pass its input through 
to become DUT Disable signals 44b. However, since the 
table 155 has been pre-loaded with all ones, no failures are 
indicated, and no DUTs will be disabled during this pass of 
the loop. 

Now assume that this first pass through the loop found an 
error in block 1 of DUT3; that spot (158) in table 154 would 
now have a zero. Before beginning the next instance of the 
loop we arrange for table 154 to be copied into table 155; 
observe that there is a zero in location 157 of table 155. The 
copying of working table 154 into accumulated table 155 is 
performed in a mode that preserves zeros already in a 
location (makes them "sticky"), and protects them from 
being overwritten with ones. Now re-initialize table 154 and 
start the next instance of the loop. During that next instance 
of the loop block one of DUT3 will have its channels 
disabled, or reported as "good" regardless of the actual facts 
(depending on how the block diagram 106 of FIG. 6 has 
been configured and the nature of the particular jamming 
performed). This process is continued until all testing has 
been accomplished. Along the way we may surmise that 
block three of DUT1 had an error, and that a loop in progress 
(or perhaps just finished) discovered trouble in block one of 
DUT0 (cell 156 in table 154) and that this will soon appear 
in cell 159 of table 155. 

Now, there is one additional aspect of this kind of testing 
that should be pointed out. The process of copying table 154 
to table 155 can be performed utilizing the counter resources 
in the Post Decode circuit 60. This particular trick is, in and 
of itself, conventional, so we won't spend any time on how 
it is done internally. But, it should be appreciated that those 
counters can be set to count down from some pre-set value, 
and not indicate an output for their associated quantity until 
the counter bottoms out. The net effect, as far as our example 
is concerned, is that it might take, say, n-many failures of a 
DUT/block combination occurring (over time) in the 
WORKING RESULTS Tag RAM (table 154) before a zero 
is written to the corresponding cell in the ACCUMULATED 
FAILURES buffer memory table 155. 

We claim: 

1. A method of simultaneously testing multiple DUT's 
using a single test site in a memory tester, the method 
comprising the steps of: 

(a) contacting the multiple DUT's with probes corre- 
sponding to channels within the single test site; 

(b) associating a collection of channels with each DUT in 
the multiplicity thereof; 

(c) defining for each channel a primary format for apply- 
ing transmit vectors and interpreting receive vectors, 
each channel's primary format for use initially and in 
the absence of detected errors for that channel, and also 
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an alternate format for applying transmit vectors and 
interpreting receive vectors, each channel's alternate 
format for use subsequent to there being detected 
selected error criteria for that channel; 

(d) generating with a test program a sequence of transmit 5 
vectors and expected receive vectors and applying the 
transmit vectors to the channels associated with all 
DUT's in the multiplicity thereof, each channel using 
the primary format associated therewith; 

(e) comparing a receive vector actually received from the 10 
DUT with an expected receive vector and determining 
that a channel is a non-comparing channel; 
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(f) pursuant to step (b) and in response to step (e), 
generating an indication that the DUT within the mul- 
tiplicity thereof, and with which the non-comparing 
channel is associated, has met selected error criteria and 
is to be disabled; and 

(g) subsequent to step (f), using for the non-comparing 
channel the alternate format associated with that non- 
comparing channel. 

2. A method as in claim 1 wherein subsequent to step (f) 
the alternate format is used for each channel associated with 
the DUT to be disabled. 
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ABSTRACT 



Functional circuits and cores of circuits are tested on inte- 
grated circuits using scan paths. Using parallel scan dis- 
tributor and collector circuits for these scan paths improves 
lest access of circuits and cores embedded within ICs and 
reduces the ICs power consumption during scan testing. A 
controller for the distributor and collector circuits includes a 
test control register, a test control state machine and a 
multiplexer. These test circuits can be connected in a hier- 
archy or in parallel. A conventional test access port or TAP 
can be modified to work with the disclosed test circuits. 

12 Claims, 20 Drawing Sheets 
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CONTROLLER FOR SCAN DISTRIBUTOR 
AND CONTROLLER ARCHITECTURE 

This application claims priority under 35 USC § 119(e) 
(1) of provisional application No. 60/074,264, filed Feb. 10, 
1998. 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

This invention relates generally to testing of integrated 
circuits with scan paths and particularly relates to testing 
integrated circuits with parallel scan distributors and collec- 
tors controlled by a controller that includes a state machine. 

2. Description of the Related Art 

Cost effective testing of today's complex integrated cir- 
cuits is extremely important to semiconductor manufactur- 
ers from a profit and loss standpoint. The increases in 
complexity of state-of-the-art integrated circuits is being 
accompanied by an ever increasing difficulty to test the 
integrated circuits. New test techniques must be developed 
to offset this increasing integrated circuit test cost, otherwise 
further advancements in future integrated circuit technology 
may be blocked. One emerging technology that is going to 
accelerate the complexity of integrated circuits even more is 
intellectual property cores. These cores will provide highly 
complex pre-designed circuit functions such as; DSPs, 
CPUs, I/O peripherals, memories, and mixed signal A/D and 
D/A functions. These cores will exist in a library and can be 
selected and placed in an integrated circuit quickly to 
provide a complex circuit function. The low cost testing of 
integrated circuits that contain highly complex core func- 
tions will be a significant challenge 

SUMMARY OF THE INVENTION 

The disclosed circuits provide a description of a controller 
for use with the parallel scan distributor and collector 
circuits. The controller has a test control register, a test 
control state machine and a multiplexer. The controller also 
has inputs and outputs for connection to additional control- 
lers in a hierarchical or parallel arrangement. The controller 
is also programmable to provide different types of test 
control for testing different types of circuits. 

The disclosed parallel scan distributor and collector cir- 
cuits provide a low power method of scan testing combina- 
tional logic within an IC by allowing scan test communica- 
tion to occur over a larger number of shorter length scan 
paths. 

With a synchronizer and delay circuit, the disclosed test 
circuits can further reduce the power needed to test the 
integrated circuits. The test circuits disclosed can be used to 
test functional combinatorial logic, random access memory, 
and digital to analog and analog to digital circuitry. Con- 
ventional IEEE 1149.1 test access port or TAP circuits can 
be modified to operate with the disclosed scan distributor 
and collectors circuits and controllers. 

BRIEF DESCRIPTION OF THE VIEWS OF THE 
DRAWINGS 

FIG. 1 depicts an integrated circuit. 

FIG. 2 is a block diagram of a known parallel scan path 
test arrangement. 

FIG. 3 is a block diagram of a parallel scan path test 
arrangement according to the present invention. 

FIG. 4 is a block diagram of the scan path test arrange- 
ment of FIG. 3 further including a test controller according 
to the present invention. 



^8,093 Bl 
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FIG. 5 is a flow chart illustrating operation of the test 
controller and scan path arrangement of FIG. 4. 

FIG. 6 is a flow chart illustrating an alternate operation of 
the test controller and scan path arrangement. 
5 FIG. 7 depicts an integrated circuit that includes an 
embedded core. 

FIG. 8 is a block diagram of a scan test circuit and 
controller arrangement for testing the integrated circuit and 
1Q core of FIG. 7 according to the present invention. 

FIG. 9 depicts an integrated circuit including an embed- 
ded core, in which the embedded core itself includes an 
embedded core. 

FIG. 10 is a block diagram of a scan test circuit and 
15 controller arrangement for testing the integrated circuit and 
embedded cores of FIG. 9 according to the present inven- 
tion. 

FIG. 11 is a block diagram of a hierarchical connection 
between scan test circuit arrangements according to the 
20 present invention. 

FIG. 12 is a block diagram of an arrangement of scan test 
circuits and controllers using multiplexer circuitry according 
to the present invention. 
25 FIG. 13 depicts an integrated circuit. 

FIG. 14 is a block diagram of an arrangement of scan test 
circuits and controllers for the integrated circuit of FIG. 13. 

FIGS. ISA and 15B are block diagrams of a controller 
used in the scan test circuits. 
30 FIG. 16 is a flow chart of states used in the controller of 
FIGS. 15. 

FIG. 17 is a block diagram of controllers arranged in a 
hierarchy. 

35 FIG. 18 is a block diagram of controllers connected in a 
multiplexed arrangement. 

FIG. 19 is a block diagram of controllers arranged in 
parallel. 

FIGS. 20A and 20B are block diagrams of integrated 
40 circuits under test and test drivers and receivers. 

FIGS. 21A and 21B are block diagrams of scan path 
circuits and representations of capacitive loadings. 

FIGS. 22 A and 22B are, respectively, a block diagram of 
a serial connection of clock signals and a timing diagram of 
45 the clock signals occurring in series. 

FIG. 23A-1 is a block diagram of a random access 
memory device including parallel scan distributor and col- 
lector circuits. 

50 FIG. 23A-2 is a flow chart of states used to test a random 
access memory device. 

FIG. 23B-1 is a block diagram of a digital to analog 
converter including parallel scan distributor circuits. 

FIG. 23B-2 is a flow chart of states used to test a digital 
55 to analog converter. 

FIG. 23C-1 is a block diagram of an analog to digital 
converter including parallel scan collector circuits. 

FIG. 23C-2 is a flow chart of states used to test an analog 
to digital converter. 

FIG. 24 is a block diagram of an integrated circuit with 
mixed signal cores. 

FIG. 25 A is a block diagram of a conventional test access 
port. 

65 FIG. 25B is a block diagram of a modified test access port. 
FIG. 26 is a block diagram of a modified test access port 
and test controller joined together. 
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DETAILED DESCRIPTION 

In FIG. 1, integrated circuit 100 comprises a semicon- 
ductor substrate 102 with bond pads 104 and functional 
circuitry 106. To expedite testing, an integrated circuit's 
functional circuitry 106 can be arranged into many parallel 
scan paths, each scan path having a serial data input and 
serial data output. Having many short parallel scan paths, 
versus one long continuous scan path, is preferred since it 
reduces the time it takes to shift test data in and out. Each 
parallel scan path's serial data input and output can be 
connected to a bond pad 104 to allow a tester to input test 
data to and output test data from all scan paths concurrently. 
Parallel scan design references include FIG. 18-3 of Chapter 
18 of 1990 IEEE Publication "The Test Access Port and 
Boundary Scan Architecture" by Colin Maunder, and FIG. 
14a of U.S. Pat. No. 5,526,365 to Whetsel. 

In FIG, 2, known parallel scan path 200 has a serial data 
input at pad 202 and a serial data output at pad 204. Known 
parallel scan path N 206 has a serial data input at pad 208 
and a serial data output at pad 210. In the circuits of FIG. 2, 
N scan paths will require use of 2xN bond pads for serial 
data input and serial data output. While some bond pads will 
be used to supply control to the scan paths and for power and 
ground, a majority of the bond pads may be used for scan 
path serial data input and output. The number of available 
bond pad pairs will limit the number of scan paths that can 
be accessed in parallel. 

The scan cycle time of the conventional scan path 
arrangement of FIG. 2 can be expressed by (L+1)T, where 
L is the scan path length through which stimulus and 
response test patterns are shifted during each scan cycle, 1 
is the capture step required to input response data from the 
functional logic under test into the scan path, and T is the 
period of the scan clock. Using this equation, for example, 
the scan cycle time for a scan path having a length (L) of 
1000 bits is (1000+1)T, or 1001T. The test time equals "scan 
cycle time" times "the number of test patterns". 

In FIG. 3, scan test circuit 301 includes a scan distributor 
300, scan paths 324 through 342 and scan collector 344. 
Parallel scan distributor circuit 300 forms a data input 
amplification circuit connected between bond pad 302 and 
data inputs 304 through 322 to ten plural scan paths 324 
through 342, of which only the first and last are depicted for 
clarity of the drawing. Parallel scan collector circuit 344 
forms an output amplification circuit connected between the 
data outputs 346 through 364 of plural scan paths 324 
through 342 and bond pad 366. 

Scan test circuit 367 includes a scan distributor 368, scan 
paths 392 through 410 and scan collector 412. In a like 
manner, parallel scan distributor circuit 368 forms a data 
input amplification circuit connected between bond pad 370 
and data inputs 372 through 390 to ten plural scan paths 392 
through 410, of which only the first and last are depicted for 
clarity of the drawing. Parallel scan collector circuit 412 
forms an output amplification circuit connected between the 
data outputs 414 through 432 of plural scan paths 392 
through 410 and bond pad 434. 

Scan paths 324 through 342 form one group of scan paths 
connected between scan distributor circuit 300 and scan 
collector circuit 344. Scan paths 392 through 410 form 
another group of scan paths connected between scan dis- 
tributor circuit 368 and scan collector circuit 412. 

In FIG. 3, the parallel scan distributor provides a data 
input amplification circuit located between a bond pad and 
data inputs to plural scan paths. The parallel scan collector 
provides a data output amplification circuit located between 
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the data outputs of the plural scan paths and a bond pad. This 
is different from the conventional parallel scan path arrange- 
ment depicted in FIG. 2 in which each scan path's data input 
is directly connected to a bond pad and each scan path's data 

5 output is directly connected to a bond pad. Therefore, the 
data amplification capability of the present invention is 
understood by comparing FIG. 2 and FIG. 3. 

The conventional parallel scan path arrangement of FIG. 
2 thus is modified by the insertion of parallel scan distributor 

10 circuits and parallel scan collector circuits. The scan dis- 
tributor circuits 300, 368 are basically serial-input parallel- 
output shift registers, and the scan collector circuits 344, 434 
are basically parallel-input serial -output shift registers. 
While the parallel input and output width of the scan 

15 distributor and collector circuits can be of any bit width, the 
distributor and collector circuits 300, 344, 368 and 412 have 
10 bit wide parallel inputs and outputs that provide one bit 
input and output to the respective parallel scan paths. 
The scan input modifications of the FIG, 2 arrangement 

20 include; (1) disconnecting the bond pads from scan paths 
1-N, (2) inserting the scan distributor circuits, (3) connect- 
ing the bond pads to the serial inputs of the scan distributor 
circuits, and (4) connecting each parallel output of the scan 
distributor circuits to a respective input of the scan paths. 

25 The scan output modifications of the FIG. 2 arrangement 
include: (1) disconnecting the bond pads from scan paths 
1-N, (2) inserting the scan collector circuits, (3) connecting 
the bond pads to the serial outputs of the scan collector 
circuits, and (4) connecting the output of each scan path to 

30 a respective parallel input of the scan collector circuits. 
The scan path modifications of the FIG. 2 arrangement 
include: (1) dividing each scan path 1-N into a group of 
individual shorter length scan paths, each preferably being 

35 of equal length, and in which the number of individual scan 
paths of each group equals to the number of parallel inputs 
and outputs (10) of the scan distributor and scan collector 
circuits, (2) connecting the serial data input of each scan 
path of each group to a parallel output of a respective scan 

4Q distributor circuit, and (3) connecting the serial data output 
of each scan path of each group to a parallel input of a 
respective scan collector circuit. 

With 10 bit deep scan distributor and collector circuits, the 
number of individual scan paths in each group is equal to 

45 ten. If the scan paths 200 and 206 of FIG. 2 were each 1000 
bits long, the above partitioning would convert each 1000 bit 
scan path into a group of ten 100 bit scan paths. 

In FIG. 4, integrated circuit 446 includes scan test circuits 
448. One scan distributor 450 and scan collector 452 pair 

50 provide access to 10 parallel scan paths 454 through 472. 
Each of the 10 parallel scan paths connects to combinational 
logic 474 in functional circuitry 106. The combinational 
logic 474 is tested by inputting test stimulus and outputting 
test response through the parallel scan paths 454 through 

55 472. While stimulus input and response output connections 
are shown only between combinational logic 474 and par- 
allel scan path 1 454, all ten of the parallel scan paths 454 
through 472, respectively, are similarly connected to com- 
binational logic 474. 

60 A controller 476 connects to the scan distributor circuit 
450, parallel scan paths 1-10 454 through 472 and scan 
collector 452, as well as all other scan distributors, parallel 
scan paths, and scan collectors in the integrated circuit by 
leads 482. Controller 476 controls the test operation of the 

65 scan distributor circuits, parallel scan paths 1-10 454 
through 472 and scan collector 452, as well as all other scan 
distributors, parallel scan paths, and scan collectors in the 
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integrated circuit. The controller 476 connects to bond pads 
478 and 480 for access and control by a source external to 
the integrated circuit, such as a wafer or integrated circuit 
tester. 

When the integrated circuit's functional circuitry is con- 
figured for testing, all functional registers (flip/flops or 
latches) in the integrated circuit are converted into scan 
registers that form the parallel scan paths shown. Also, 
during test configuration, all combinational logic in the 
integrated circuit that was associated with the functional 
registers remains associated with the scan registers after the 
conversion. This conversion of an integrated circuit's func- 
tional circuitry into scan paths and combinational logic is 
well known. 

The combinational logic 474 is tested by receiving test 
stimulus data from the parallel scan paths 454 through 472 
and outputting test response data to the parallel scan paths 
454 through 472. The test stimulus data applied to the 
combinational logic 474 from the parallel scan paths is input 
to the parallel scan paths via the scan distributor 450. The 
test response data received into the parallel scan paths from 
the combinational logic is output from the parallel scan 
paths via the scan collector 452. During test, the controller 
476 operates the scan distributor 450, parallel scan paths 
454-472, and scan collector 452 to test the combinational 
logic 474. Simultaneous with this test, the controller 476 
also operates other scan distributors, parallel scan paths, and 
scan collectors of the integrated circuit to test further com- 
binational logic within the integrated circuit. 

In FIG. 5, the flow chart illustrates one example of the 
controller operating the scan distributor, parallel scan paths, 
and scan collector of FIG. 4 during testing of the integrated 
circuit's combinational logic. Initially, the controller will be 
in the start test state waiting for a signal to start testing. In 
response to a start test signal, the controller executes the 
following steps. The step numbers correspond to the state 
numbers in the diagram of FIG. 5. 



Step Number 


Operation 


501 


Test to see if start test has occurred. 




No, goto 501. 




Yes, goto 502. 


502 


Configure functional circuitry into test 




mode, goto 503 


503 


Capture response data outputs from all 




parallel scan paths (PSPs) into scan 




collector (PSC), goto 504 


504 


Shift scan distributor and collector ten 




times to load stimulus data into distributor 




and unload response data from collector, goto 




505 


505 


Shift scan paths one time to load scan paths 




with test stimulus data from scan 




distributor, goto 506 


506 


Test to see if parallel scan paths (PSPs) 




have filled with the test stimulus pattern 




No, goto 503 




Yes, goto 507 


507 


Test to see if end of test has occurred 




No, goto 508 




Yes, goto 509 


508 


Capture response pattern from combinational 




logic into parallel scan paths (PSPs), goto 




503 


509 


End of test, configure IC circuitry into 




normal mode, goto 501 



Following the end of test step 507, the test is complete and 
the controller configures the functional circuitry back into its 
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normal mode, then goes to and remains in the start test state 
501 until another start test signal occurs. During the test, a 
tester supplies stimulus data to the scan paths via the serial 
to parallel operation of the scan distributors, and receives 

5 response data from the scan paths via the parallel to serial 
operation of the scan collectors. The tester compares the 
response data it receives from the scan collectors to expected 
response data to determine if the test passes or fails. 
Alternately, during test the tester may compress the response 

10 data it receives from the scan collectors into signatures and 
then compare the signatures at the end of test to expected 
signatures. 

In FIG, 6, an example of another controller flow chart 
illustrates how the decision states 506 and 507 of FIG. 5 may 

15 be merged into state 605 of FIG. 6 to streamline the test 
execution flow. In FIG. 6, state 605 executes the shift 
operation that moves data from the scan distributors into the 
scan paths, then executes decision states to determine 
whether the next state will be state 503, 508, or 509. 

20 Merging the decision states into state 605 is possible because 
the decisions regarding the full/not full status of the scan 
paths and the end of test are easily predictable conditions. 



25 



30 



45 



Step Number 


Operation 


501 


Test to see if start test has occurred 




No, goto 501 




Yes, goto 502 


502 


Configure IC circuitry into test mode, goto 




503 


503 


Capture response data outputs from all 




parallel scan paths into scan collectors, 




goto 504 


504 


Shift scan distributors & scan collectors 




ten times to load stimulus data into scan 




distributors and unload response data from 




scan collectors, goto 605 


605 


Shift scan paths one time to load scan paths 




with test stimulus data from scan 




distributors, then 




If scan path is not filled, goto 503 




If scan path is filled & not end of test, 




goto 508 




If scan path is filled & end of test goto 




509 


508 


Capture response pattern from combinational 




logic into scan paths, goto 503 


509 


Configure IC circuitry into normal mode, goto 




501 



While the test data input and output bandwidth of the scan 
paths 454 through 472 is reduced by the serial to parallel 

50 translation in scan distributor 450 and parallel to serial 
translation in scan collector 452 that occurs for each datum 
shifted into and out of the parallel scan paths. The overall 
test time however is comparable to the conventional parallel 
scan test times for the circuits of FIG. 2. The reason for this 

55 is that scan distributor and scan collector circuits enable test 
data to be communicated to a larger number of shorter length 
parallel scan paths, whereas the conventional parallel scan 
arrangement of FIG. 2 communicates test data to a lesser 
number of longer length scan paths. 

60 The scan cycle time of the scan distributor and scan 
collector arrangement of FIG. 4, using the FIG. 6 controller 
operation steps, can be expressed by equation ((D+2)L+1)T, 
where: (D+2) is the scan depth (D) of the scan distributor 
and scan collector circuits shifted, step 504; plus 2, the 

65 shifting of data between scan distributor and scan paths in 
step 605, and between scan collector and scan paths in step 
503; L is the scan path length through which data is shifted 
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during each scan cycle; plus 1, the capture step 508 required architectures. The view is simplified in that it depicts only 

to input data from the combinational logic into the scan one exemplary pair of scan distributor and scan collector 

paths; and T is the period of the scan clock. circuits for each of the functional and core circuits. 

For the purpose of illustrating a comparison of the scan i n p IG ^ functional scan test circuits 801 associate with 

cycle times between the conventional path arrangement of 5 functional circuits 702. Parallel scan distributor circuit 800 

FIG. 2 and the scan distributor and scan collector scan path forms a data • t amplification circuit connected between 

arrangement of FIG. 4, the L in the scan distributor and scan bond d 8Q2 and data { 8Q4 m tQ teQ lufal 

collector scan cycle time equation above can be expressed in ^ ^ ^ m fa g42 of whicfa ^ ^ fifs( ^ ^ 

terms of the L in the conventional scan cycle time equation. j j c 1 *7 r iL j * n « 1 n 

* * 1 j u j • j t Kir* 1 «• 1 are depicted for clarity of the drawing. Parallel scan collec- 

As previously described in regard to FIG. 3, a conventional in t . r . t OAA c 3 4 4 * t . 

r 4U . • i *L/n° u fl nnn „„l„A „ 10 tor circuit 844 forms an output amplification circuit con- 
scan path having a length (L) can be converted into a group ^ o >•/»!. u c 1 1 

<■ * a- a 1 *u u u « i„„~«u nfi /m nected between the data outputs 846 through 864 of plural 

of ten individual scan paths each having a length of L/10, *u v*a *u u oa*» a u a a qcc t> a a 

when using 10 bit scan distributor and scan collector cir- fj£ D P a <^ * 24 throu S h f 42 an ? ^ P* d 86 f ■ *° nd P ads 

cuits. Converting the original conventional scan path of FIG. f 2 and f 66 are P** of P eri P heral bond P ads 706 of the 

2 into an equivalent scan distributor and scan collector scan Pactional circuits 7UZ. 

path arrangement does not modify the stimulus and response A controller 876 connects to the scan distributor circuit 
connections to the combinational logic, it simply partitions 80 °, parallel scan paths 1-10 824 through 842 and scan 
the single conventional scan paths into an equivalent group collector 844, by leads 882. Controller 876 controls the test 
of shorter length scan paths. Therefore, for the purpose of operation of the scan distributor circuit 800, parallel scan 
comparing scan cycle times between the conventional scan paths 1-10 824 through 842 and scan collector 844. The 
path arrangement of FIG. 2 and a converted, but equivalent, controller 876 connects to bond pads 878 and 880 for access 
stimulus and response connection, scan distributor and scan and control by a source external to the integrated circuit 700, 
collector scan path arrangement of FIG, 4, L/10 is substi- such as a wafer or integrated circuit tester. Bond pads 878 
tuted for L in the scan distributor and scan collector scan and 880 are part of peripheral bond pads 706. 
cycle time equation above. 25 In core circuits 704, core scan test circuits 901 associate 
This results in a scan distributor and scan collector scan with core circuits 704. Parallel scan distributor circuit 900 
cycle time equation of: ((D+2)(L/10)+1)T, or ((10+2)(L/ forms a data input amplification circuit connected between 
10)+1)T, or (L2L+1)T, where: L equals the bit length of the terminal 902 and data inputs 904 through 922 to ten plural 
original scan path of FIG. 2, and D equals the depth (i.e. 10 scan paths 924 through 942, of which only the first and last 
bits) of the scan distributor and scan collector circuits. 30 are depicted for clarity of the drawing. Parallel scan collec- 
Substituting L-1000 into the conventional scan path tor circuit 944 forms an output amplification circuit con- 
equation, (L+1)T, of FIG. 2 and scan distributor and scan nected between the data outputs 946 through 964 of plural 
collector equation, (1.2L+1)T, above, results in 1001T and scan paths 924 through 942 and terminal 966. Terminals 902 
1 201 T, respectively. In comparing 1001T to 1201T, it is seen and 966 are part of core circuit terminals 708 of the core 
that the conversion of the conventional scan path arrange- 35 circuits 704. 

ment into an equivalent scan distributor and scan collector A controller 976 connects to the scan distributor circuit 

scan path arrangement only extends the scan cycle time by 900, parallel scan paths 1-10 924 through 942 and scan 

approximately 16.6%, in this example. collector 944, by leads 982. Controller 976 controls the test 

The scan distributor and scan collector scan cycle time operation of the scan distributor circuit 900, parallel scan 

advantageously approaches the conventional scan test time 40 P aths 1-10 924 through 942 and scan collector 944. The 

as the depth of the scan distributor and scan collector circuits controller 976 connects to terminals 978 and 980 for access 

increase, since test data may be communicated to a larger and control by controller 876 over leads 984 and 986. 

number of shorter length parallel scan paths. For example, Terminals 978 and 980 are part of core circuit terminals 708. 

with 40 bit deep scan distributor and scan collector circuits Scan distributor 800 has a serial output on lead 884 

connected to forty 25 bit scan paths, converted from the FIG. 45 connecting to one input of multiplexer 886. The other input 

2 scan path as described above, the scan distributor and scan of multiplexer 886 receives a signal FI. The sole output of 

collector scan cycle time becomes (40+2)(L/40)+l)T, or multiplexer 886 connects on lead 888 to terminal 902. 

(1.05L+1)T, which extends the scan cycle time by approxi- Terminal 966 connects to the sole input of demultiplexer 

mately 4.7% compared to the conventional scan cycle time. 890. One output of demultiplexer 890 on led 892 connects 

For identical combinational logic being tested, the number 50 to a serial input of scan collector 844. The other output of 

of scan cycles required to apply the test patterns is the same demultiplexer 890 connects to a signal FO. Controller 876 

for both the scan distributor and scan collector and conven- connects to the multiplexer 886 on lead 894 and connects to 

tional scan path arrangements. The integrated circuit test the demultiplexer 890 on lead 896. 

time will therefore be extended in proportion to the scan i n t h e integrated circuit 700, the scan distributor 800 and 

cycle time extension. 55 scan collector 844 circuits are associated with the I/O bond 

In FIG. 7, an integrated circuit 700 contains within its pads for the integrated circuit 700. In the core 704, the scan 

functional circuitry 702 a complex core circuit 704, such a distributor 900 and scan collector 944 circuits are associated 

DSP. The integrated circuit's functional circuit 702 contains with the I/O terminals for the core circuits 704. The scan 

other circuits besides the core. Integrated circuit 700 distributor 800 and scan collector 844 circuits are the same 

includes peripheral bond pads 706 and core circuit 704 60 as described in regard to FIG. 4, except that the scan 

includes its own set of peripheral terminals 708. In this distributor circuit 800 has been provided with a serial output 

example, both the integrated circuit 700 and core 704 have 884 and the scan collector 844 circuit has been provide with 

been designed to include the previously described invention a serial input 892. The core's scan distributor 900 and scan 

comprising scan distributor and scan collector circuits, par- collector 944 circuits are the same as scan distributor 800 

allel scan paths, and the controller 476. 65 and scan collector 844 circuits with two exceptions: they are 

In FIG. 8, the integrated circuit 700 includes functional associated with the core's terminals 902 and 966 and they 

circuit and core circuit scan distributor and scan collector have no serial output 884 or serial input 892. 
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A multiplexer 886, or other type of connection circuit, is the core's scan distributor circuit 900, and the serial data 

provided at each core terminal that has a scan distributor, output from the core's scan collector circuit 944 is input to 

and a demultiplexer 890, or other type of connection circuit, the integrated circuit's scan collector circuit 944. Further, 

is provided at each core terminal that has a scan collector. the integrated circuit controller 876 disables the non-core 

The multiplexer allows inputting either a functional input 5 scan paths from shifting and capturing data and causes the 

signal or test input to the core terminal. The demultiplexer scan distributor 800 and scan collector circuits 844 to 

allows outputting either a functional output signal or test operate as test data pipeline registers between the integrated 

output from the core terminal. circuit pads 802 and 866 and the core's scan distributor 900 

The test input to the multiplexer 886 comes from the serial and scan collector 944. During test, the core's scan distribu- 

output of the integrated circuit's scan distributor circuit 800, 10 tor '00 and scan collector 944 circuits are controlled by the 

and the test output from the demultiplexer 890 goes to the core's controller 976 to operate as described in regard to 

serial input of the integrated circuit's scan collector circuit FIGS. 5 or 6, The only difference is that the depth of the scan 

844. The functional input and output, FI and FO, are data input to and output from the core's scan distributor 900 

connected to neighboring circuits within the integrated cir- and scan collector 944 circuits is greater since the data is 

cuit. During normal mode, the integrated circuit's controller 15 pipelined though the integrated circuit's scan distributor 800 

876 controls the multiplexers and demultiplexers at the core and scim collector 844 circuits. 

terminals to input and output the functional signals. In test When both the integrated circuit's non-core and core 

mode, the controller 876 controls the multiplexers and circuitry are being tested, both the integrated circuit and core 

demultiplexers to input and output test signals. controllers 876 and 976 are enabled. Also the core terminal 

Controller 976 is not directly connected to the peripheral 20 multiplexer 886 and demultiplexer 890 are set to input test 

bond pads 878 and 880 as is controller 876. Rather, con- data 10 the core's scan distributor 900 from the integrated 

troller 976 for the core circuits is connected indirectly to the circuit's scan distributor 800, and to outpuUest data from the 

peripheral bond pads via the controller 876. Controller 876 core's scan collector 944 to the integrated circuit's scan 

has authority over the core's controller 976 in that controller collector 844. During test, both controllers 876 and 976 are 

876 can enable, disable or modify the operation modes of 25 synchronized to the external control input from the tester via 

controller 976. However, during test the controllers may the peripheral bond pads to allow stimulus data to be 

operate together to synchronize the operation of the scan scanned into both the integrated circuit and core scan 

distributor and scan collector circuits of the integrated distributor circuits while response data is scanned out from 

circuit and core. both the integrated circuit and core scan collector circuits. 

As will be seen in embodiments to be described, this 30 The test operation of the integrated circuit and core scan 

controller interconnect technique is maintained between distributor and scan collector circuits is identical to that 

controllers that are arranged hierarchically within an inte- previously described in regard to FIGS. 5 or 6. Again, the 

grated circuit. Also, the authority of a higher level controller only difference is that the depth of the scan data input and 

over a lower level controller is maintained in controllers ^ scan data output is greater since the integrated circuit and 

arranged within a hierarchy. Further maintained is the ability core scan distributor and scan collector circuits are serially 

of hierarchical controllers to synchronize themselves during connected. The advantage of testing both the integrated 

test so that the operation of all hierarchically linked scan circuit's non-core and core circuitry at the same time as that 

distributor and scan collector circuits, associated with the it reduces the test time of the integrated circuit, 

controllers, occur synchronously. 4Q These three modes of testing can be setup by scanning the 

Testing, using the integrated circuit and core scan dis- integrated circuit and core controllers. Referring to FIG. 8, 
tributor and scan collector circuits of FIG. 8, is the same as the integrated circuit controller is connected to integrated 
previously described for the circuits of FIG. 4 with two circuit pads for input and output and the core controller is 
exceptions. The serial data input to the core's scan distribu- connected to the integrated circuit controller for input and 
tor circuit 900 passes through the integrated circuit's scan 45 output. A tester that is connected to the integrated circuit 
distributor circuit 800 and the serial data output from the controller input/output bond pads 706 can scan the control- 
core's scan collector circuit 944 passes through the inte- lers to set up the type of test to be performed. After setting 
grated circuit's scan collector circuit 844. Three types of up the test type, the tester can input control on input pads to 
testing can occur on the integrated circuit 700: (1) testing of cause the controllers to operate according to the way the 
the integrated circuit's functional non-core circuitry, (2) 50 controllers have been set up. While the integrated circuit 700 
testing of the core circuitry, and (3) simultaneous testing of has one core, other integrated circuits may contain multiple 
both the integrated circuit's non-core circuitry and the core cores. Multiple cores can be tested either individually or in 
circuitry. combination with other cores and non-core circuits. 

When the integrated circuit's non-core circuitry is being In FIG. 9, integrated circuit 1000 contains functional 

tested, but the core is not being tested, the core's controller 55 circuitry 1002, which contains first core circuitry 1004. First 

976 is disabled by the integrated circuit's controller 876 and core circuitry 1004 contains second core circuitry 1006. This 

the multiplexer 886 and demultiplexer 890 disconnect the hierarchical embedding of core circuitry or cores within 

core's terminals from inputting or outputting functional cores creates a very difficult testing situation. The present 

signals to neighboring integrated circuit circuitry. In this invention however renders such nesting of cores testable 

mode the core is quiet and its I/O is disabled from disturbing go reg^dless of how deeply embedded they might be within an 

testing being performed on the non-core circuitry. integrated circuit. 

When the core is being tested, but the non-core circuitry Functional circuitry 1002 is associated with bond pads 

is not being tested, the core's controller 976 is enabled by the 1008. First core circuitry is associated with terminals 1010. 

integrated circuit's controller 876. The integrated circuit's Second core circuitry is associated with terminals 1012. 

controller 876 controls the core terminal multiplexer 886 65 In FIG. 10, the scan distributor and scan collector archi- 

and demultiplexer 890 such that the serial data output from lecture is shown hierarchically extending from the integrated 

the integrated circuit's scan distributor circuit 800 is input to circuit level to the first core level, and from the first core 
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level into the second core level. Integrated circuit 1000 core circuits 1214 that include scan test circuits 1216 and 

comprises functional scan test circuits 1014 associated with core circuits 1218 that include scan test circuits 1220. 

functional circuitry 1002, first scan test circuits 1016 asso- Controller 1222 is associated with scan test circuits 1216 

ciated with first core circuits 1004 and second scan test and controller 1224 is associated with scan test circuits 

circuits 1018 associated with second core circuits 1006. 5 1220. 

In accordance with the circuits depicted in FIGS. 7 and 8, Multiplexer circuitry 1226 connects scan test circuits 
test access to the second scan test circuits 1018 is achieved 1202 to scan test circuits 1216 and 1220. A serial output 
through the serial pipelines provided by the first scan test 1228 of scan distributor 1230 connects to the multiplexer 
circuits 1016 and functional scan test circuits 1014. Thus the 1226 and a serial input 1232 of scan collector 1234 connects 
scan distributor 1020 and scan collector 1022 circuits of 10 to multiplexer 1226. Scan test circuits 1216 connect to 
second core circuits 1006 is achieved via the serial pipelines multiplexer 1226 through multiplexer 1236, which also 
provided by the scan distributor and scan collector circuits receives a functional input FI, and through demultiplexer 
1024 and 1026 of first scan test circuits 1016 and the scan 1238, which also provides a functional output FO. Scan test 
distributor and scan collector circuits 1028 and 1030 of the circuits 1220 connect to multiplexer 1226 through multi- 
functional scan test circuits 1014. 35 plexer 1240, which also receives a functional input FI, and 

Also as described in regard to FIG. 8, all the functional through demultiplexer 1242, which also provides a func- 

circuits 1002, first core circuits 1004 and second core tional output FO. Controllers 1222 and 1224 also connect to 

circuits 1006 can be tested together, in selected multiplexer 1226 through respective leads 1244, 1246, 1248 

combinations, or individually. When testing all of the inte- aQ d 1250. 

grated circuit's circuitry together, the scan distributor and 20 Integrated circuit 1200 provides an alternate configuration 

scan collector circuits and controllers are set up to allow the for using scan distributor and scan collector circuits whereby 

tester to input deep stimulus patterns to serially connected costs 1214 and 1218 are individually selected and connected 

scan distributors and to output deep response patterns from to the integrated circuit's scan distributor and scan collector 

serially connected scan collectors. The test is the same as circuitry and controller for testing. This selection is achieved 

described in connection with FIG. 8, only the depth of the 25 by placing multiplexer circuitry 1226 between the integrated 

serial stimulus and response patterns changes as more scan circuit's scan distributor 1230, scan collector 1234, and 

distributor and scan collector circuits are added to the controller 1208 circuitry, and the cores. Thus the cores 1214 

integrated circuit's bond pad input and output scan opera- and 1218 can be individually connected to the serial data 

tions. input and output of the integrated circuit's scan distributor 

In FIG. 11, integrated circuit 1100 includes peripheral 30 and scan collector circuitry and to the integrated circuit's 

bond pads 1102, functional circuits 1104 and scan test controller. The integrated circuit's controller supplies the 

circuits 1106, 1108, 1110 and 1112. Scan test circuits 1106, control input to the multiplexer circuitry for selecting a core 

1108, 1110 and 1112 are connected in series to each of bond for testing. Once a core is selected and connected to the 

pads 1114 and 1116. integrated circuit's scan distributor and scan collector 

The scan test circuits 1106, 1108, 1U0 and 1112 illustrate 35 circuitry, the core is tested as previously described, 

a simplified view of how scan distributor and scan collector It is important to note that when the integrated circuits 

circuits can be used hierarchically within an integrated 446, 700, 1000, 1100 or 1200 evolve into a core for use 

circuit to bring about massive parallel scan testing. Each inside another integrated circuit, their hierarchical scan 

available pair of integrated circuit bond pads can be viewed 40 distributor and scan collector test architectures are reusable 

as entry and exit points to a hierarchical arrangement of inside that integrated circuit. The ability to reuse the test 

embedded scan distributor and scan collector circuits. Each architecture, as well as the test patterns developed for the 

scan distributor and scan collector circuit can be serially architecture, is an important feature of the present invention, 

linked to the bond pads, either directly, as with the scan This feature prevents having to spend design resources and 

distributor and scan collector circuits 1118 and 1120, or via 45 time redesigning the core's test architecture each time the 

intermediate scan distributor and scan collector circuits, core is used inside a new integrated circuit. A core's scan 

such as scan distributor and scan collector circuits 1122 and distributor and scan collector test architecture can be viewed 

1124, or 1126 and 1128. as plug and play as far as its reuse within an integrated 

In FIG. 11, 4 levels of 10 bit scan distributor and scan circuit, 
collector circuits are linked to the bond pad pair 1114, 1116 50 In FIG. 13, integrated circuit 1300 contains non-core 
to provide a 40 bit wide test data input and output interface circuitry 1301, core 1 1302 and core 2 1304 that contain the 
using only two of the integrated circuit bond pads. Each disclosed scan distributor and scan collector test architec- 
level could represent the hierarchical position of an embed- ture. Integrated circuit 1300 has bond pads 1306, core 1 
ded core within the integrated circuit. While not shown, all 1302 has terminals 1308 and core 2 1304 has terminals 
available pad pairs (i.e. pads not used for test control or 55 1310. If each of the cores have a number of terminals that 
power/ground) can be similarly connected in a hierarchical consume most of the pads on the integrated circuit, they will 
arrangement to 40 bit wide scan distributor and scan col- have to be individually selected and tested using the multi- 
lector circuits inside the integrated circuit. A tester con- plexing approach described in regard to FIG. 12. However, 
nected to the pad pairs can transfer test data to the target test if the cores have a small number of terminals relative to the 
circuits residing in the integrated circuit at each hierarchical 60 number of integrated circuit pads then parallel or simulta- 
circuit level 1-4. The serial to parallel and parallel to serial neous testing of the cores is possible as described in FIG. 14 
test data operation of hierarchically arranged scan distribu- below. 

tors and scan collectors is clear from FIG. 11, i n FIG. 14, non-core circuitry 1301, core 1 1302, and core 

In FIG. 12, integrated circuit 1200 includes scan test 2 1304 of the integrated circuit 1300 of FIG. 13 each contain 

circuits 1202 connected to bond pads 1204 and 1206. 65 integrated circuit pad connections to their scan distributor 

Controller 1208 connects to bond pads 1210 and 1212 and and scan collector architectures for parallel testing. This is 

scan test circuits 1202. Integrated circuit 1200 also includes possible because the number of terminals required to gain 
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access to the scan distributor and scan collector architectures (SDO). The SDO signal is output from the controller either 

associated with the non-core circuitry, core 1, and core 2 is by an integrated circuit pad or core terminal, as seen in FIG. 

less than or equal to the number of available integrated 8. 

circuit pads. The test terminals 1402, 1404 for the scan The state machine responds to the TPI, TCI and TEI 

distributor and scan collector architecture for the non-core 5 inputs to: (1) control the operation of the test control register 

circuits, as well as the test terminals 1406, 1408 and 1410, via the control output from the state machine, and (2) control 

1412 for the scan distributor and scan collector architecture the operation of the external scan distributor, scan collector, 

of core 1 and core 2 can all be coupled to integrated circuit and scan path circuits via the SHDC, CPC, SHPSP, and 

pads of integrated circuit 1300. CPPSP outputs from the state machine. The control input to 

For simplification only one pair of scan distributor and 10 the state machine from the test control register is used to 

scan collector circuits are shown in the non-core, core 1 and program the way the scan distributor, scan collector, and 

core 2 examples of FIG. 14. However, each example may scan path circuits are controlled using the SHDC, CPC, 

contain a plurality of scan distributor and scan collector SHPSP, and CPPSP signals. Also, the programming control 

circuit pairs coupling a plurality of grouped scan paths. Also input to the state machine can also modify the operation of 

in FIG. 14 it is seen that each controller within each scan is the SHDC, CPC, SHPSP, and CPPSP signals, and enable 

distributor and scan collector architecture is shown con- additional control output signals to allow other types of test 

nected to separate integrated circuit pads. Having separate control sequences to be performed, 

pads coupled to each architecture's scan distributor, scan State Machine Control of Test Control Register 

collector, and controller allows each architecture to be In FIG. 15B, the test control register 1502 contains a shift 

operated independently. For example, testing of the non-core 20 register 1510 and an update register 1512. The shift and 

circuitry, core 1, and core 2 could occur in response to a update registers are initialized by a reset (RST) control 

different control and data communication at the integrated output from the state machine. The shift register shifts data 

circuit pads coupled to the respective architectures. Thus from SDI to SDOl by a clock (CK) control output from the 

testing could occur at different times, have different state machine. The update register updates control data from 

durations, and/or use different clock rates. The cores 1 and 25 the shift register by an update (UPD) control output from the 

2 of FIG. 14 may contain embedded cores as shown in FIGS. state machine. The update register is used to prevent the 

9 and 10, each embedded core containing scan distributor control outputs of the test control register from changing as 

and scan distributor architectures and being testable as data is shifted through the shift register, and its use as such 

previously described. is well in the art. 

Controller Description 30 When TEI is low, the state machine is disabled to a known 

In FIG. 15 A a controller 1500 is an example of the state and outputs a low on RST to initialize the shift and 

controller used in the disclosed scan distributor and scan update registers of the test control register. When initialized, 

collector architecture. The controller 1500 consists of a test the test control register outputs control to the multiplexer to 

control register 1502, a test control state machine 1504, and connect SDOl to SDO. Also, following initialization, the 

a multiplexer 1506. The state machine 1504 has inputs for 35 test control register outputs control to the state machine to: 

receiving a test protocol input (TPI), a test clock input (TCI), (1) establish an initial operation mode for the state 

a test enable input (TEI), and control input from the test machine's SHDC, CPC, SHPSP, and CPPSP outputs, (2) 

control register. The TPI, TCI, and TEI signals are input to output control to the scan distributor, scan collector, and 

the controller either by integrated circuit pads, or core scan path circuits external of the controller to enable normal 

terminals, as seen in FIG. 8. 40 operation of the integrated circuit or core in which the 

The state machine 1504 has outputs for providing a shift controller resides, and (3) outputs a low on TEO to similarly 

distributor and collector output (SHDC), a capture collector disable and initialize any connected lower level controller, 

output (CPC), a shift parallel scan path output (SHPSP), and When TEI is high, the state machine is enabled to respond 

a capture parallel scan path output (CPPSP). The SHDC, to the TCI and TPI inputs to scan data through the test 

CPC, SHPSP, and CPPSP signals are output from the 45 control register from SDI to SDO, and to update and output 

controller to the scan distributor, scan collector, and scan control data from the test control register. It is important to 

path circuits, as shown in FIG. 8, and are used to control the note that when the state machine is first enabled to scan data 

scan distributor, scan collector, and scan path circuits during through the test control register, the scan path only includes 

test. Additional signals may be output from the state the test control register between the SDI path input and SDO 

machine as required to provide different types of control 50 output. The control data updated from the test control 

during test. The state machine also has control outputs which register following a scan operation may include control to 

are input to the test control register. condition the multiplexer and the TEO output to allow a scan 

The test control register 1502 has an input for receiving a path connected between the SDOl and SDI1 signals to be 

serial data input (SDI) and inputs for receiving control from added to the test control register scan path so that it may be 

the state machine. The SDI signal is input to the controller 55 included in subsequent test control register scan operations, 

either by an integrated circuit pad or core terminal, as seen A scan path existing between SDOl and SDI1 that has been 

in FIG. 8. The test control register has an output for added to the test control register scan path, may be deleted 

providing a serial data output 1 (SDOl, an output for from being scanned by conditioning the multiplexer and 

providing a test enable output (TEO) to a connected lower TEO output to disallow scan operations through the scan 

level controller, and a controller bus output that provides 60 path between SDOl and SDI1. U.S. Pat. No. 4,872,169 by 

control to the state machine and multiplexer within the Whetsel, entitled Hierarchical Scan Selection, describes a 

controller, and to the scan distributor, scan collector, and method of adjusting scan path lengths, 

scan path circuits of the test architecture, including test State Machine Control of Scan Distributors, Scan Collector, 

multiplexers and demultiplexers shown in FIGS. 8 and 12, and Scan Path Circuits 

external of the controller. The multiplexer inputs control and 65 In addition to responding to TPI and TCI input to operate 

SDOl from the test control register, and a serial data input test control register scan operations, the state machine 

1 (SDI1). The multiplexer outputs a serial data output responds to TPI and TCI input to operate the SHDC, CPC, 
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SHPSP, and CPPSP outputs to the scan distributor, scan 
collector, and scan path circuits. Prior to operating the 
SHDC, CPC, SHPSP, and CPPSP outputs, a scan operation 
to the test control register is performed. This scan operation 
establishes control input to the state machine to program the 
SHDC, CPC, SHPSP, and CPPSP outputs to operate in 
modes to support the types of testing previously described in 
regard to the non-hierarchical scan distributor and scan 
collector architecture of FIG. 4 and the hierarchical scan 
distributor and scan collector architecture of FIG. 8. This 
scan operation also establishes the type of test mode control 
which is output from the controller and input to the scan 
distributor, scan collector, scan path, and other associated 
test circuits, such as the multiplexer and demultiplexer 
circuits of FIGS. 8 and 12. 

If a non-hierarchical test operation is to be performed, i.e. 
only a single controller and its associated scan distributor 
and scan collector circuits are being setup for testing (FIG. 
4), a single test control register scan operation is all that is 
required prior to using the state machine to operate the 
SHDC, CPC, SHPSP, and CPPSP outputs. However, if a 
hierarchical test operation is to be performed, i.e. multiple 
levels of controllers and their associated scan distributor and 
scan collector circuits are being setup for testing (FIG. 8), 
multiple test control register scan operations are required, 
prior to using the state machine to operate the SHDC, 
CDPC, SHPSP, and CPPSP outputs, to allow the scan paths 
of the lower level controllers to be connected to the scan 
path of the highest level controller, as will be described later 
in regard to FIGS. 17, 18, and 19. 

In a non-hierarchical test operation, for example as 
described in FIG. 4, the state machine receives control from 
TPI and TCI to scan the test control register to setup the test 
to be performed. Following this scan operation, the state 
machine receives further control from TPI and TCI to 
operate the SHDC, CPC, SHPSP, and CPPSP outputs to 
control the scan distributor, scan collector, and scan path 
circuits during the test. To understand better the operation of 
the state machine, a state diagram is provided in FIG. 16. 
This state diagram accompanied by the following descrip- 
tion provides a description of how the state machine oper- 
ates. 

In FIG. 16, an example state diagram 1600 of one 
preferred implementation of the state machine is shown. 
This diagram illustrates how the state machine responds to 
the TPI and TCI inputs to transition into various states that 
enable scanning of the test control register and controlling of 
the scan distributor, scan collector, and scan path circuits. 
The TCI input to the state machine is the clock that times the 
operation of the state machine, whereas the TPI input to the 
state machine is the input that causes the state machine to 
transition between its states. 

Whenever the TEI input is low, the state machine goes to 
and remains in the reset (RESET) state 1602 and will not 
respond to the TPI input. In the RESET state, the state 
machine outputs control to initialize the test control register 
as previously described. When the TEI input is high, the 
state machine is enabled to respond to the TPI input. After 
TEI goes high, the state machine remains in the RESET state 
if TPI is high. The state machine transitions to and remains 
in the idle (IDLE) state 1604 in response to a low on TPI. In 
the IDLE state, the RST control input to the test control 
register (FIG. 15B) is set high to remove the rest condition 
on the shift and update registers. 

In response to a high and low input on TPI, the state 
machine transitions to the shift test control register state 
(SHIFT-TCR) 1606, via the select test control register state 
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(SELECT-TCR) 1608. In the SHIFT-TCR state, the state 
machine output control (CK'of FIG. 15B) to shift data 
through the test control register from SDI to SDO of FIG. 
15A. The state machine remains in the SHIFT-TCR state for 

5 the number of TCI inputs required to shift data into the test 
control register. When the shift operation is complete, a high 
on TPI transitions the state machine into the update test 
control register state (UPDATE-TCR) 1610, where the state 
machine outputs control (UPD of FIG. 15B) to cause the 

10 update register to load and output the data shifted into the 
shift register. 

From the UPDATE-TCR state 1610, the state machine is 
designed to either transition back to the IDLE state 1604 if 
TPI if low, or transition to the select test state (SELECT- 

15 TEST) 1612 if TPI is high. This two way next state decision 
was designed into the state machine to facilitate the inven- 
tion's ability to setup either hierarchical or non-hierarchical 
test architectures. For example, if the test setup is for a 
non-hierarchical test architecture (i.e. FIG. 4), the next state 

20 from UPDATE-TCR is preferably the SELECT-TEST state 
1612 to allow the state machine to immediately start out- 
putting SHDC, CPC, SHPSP, and CPPSP control to the scan 
distributor, scan collector, and scan path circuits. However, 
if the test architecture is hierarchical (i.e. FIG. 8), the next 

25 state from UPDATE-TCR will preferably be the IDLE state 
1604 to allow transitioning back into the SHIFT-TCR state 
to scan additional setup control data into a lower level 
controller that has been enabled, via TEO, and whose test 
control register has been inserted into the test control 

30 register scan path of the higher level controller, via SDOl 
and SDI1. 

When all setup scan operations are completed, the state 
machine responds to TPI to transition into the SELECT- 
TEST state 1612. In the SELECT-TEST state a decision can 

35 be made that will start the test by enabling the SHDC, CPC, 
SHPSP, and CPPSP outputs, or not start the test and return 
to the RESET state. Assuming it is desired to start the test, 
the state machine will respond to TPI to transition from 
SELECT-TEST to the CPC state 1614. In the CPC state, the 

40 state machine outputs CPC control to enable the scan 
collectors to capture the serial data output from the scan 
paths. After the data is captured, the state machine responds 
to TPI to transition into the SHDC state 1616 where the state 
machine outputs SHDC control to enable data to be shifted 

45 into the scan distributors and shifted out of the scan collec- 
tors. 

After remaining in the SHDC state long enough to fill the 
scan distributors and empty the scan collectors, the state 
machine responds to TPI to transition into the SHPSP state 

50 1618. In the SHPSP state, the state machine outputs SHPSP 
control to enable the scan paths to shift in data from the scan 
distributors. If the scan paths have not been filled with data 
from scan distributors, the state machine will be controlled 
by TPI to transition from the SHPSP state to the CPC state 

55 1614. If the scan paths have been filled with data from the 
scan distributors, the state machine will be controlled by TPI 
to transition from the SHPSP state into the CPPSP state 
1620. In the CPPSP state, the state machine outputs CPPSP 
control to enable the scan paths to capture data from 

60 combinational logic being tested. If the test is not complete, 
the state machine will respond to TPI to transition from the 
CPPSP state to the CPC state and repeat the above test 
control sequence. IF the test is complete, the state machine 
can respond to TPI to transition from the CPPSP state 

65 directly to the IDLE state 1064. 

As seen in FIG. 16, the state machine 1504 may also 
complete a test by transitioning into the IDLE state from the 
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CPC state 1614. At the end of a test, the state machine 
responds to TPI to transition from the IDLE state to the 
RESET state. Alternately, the state machine may enter the 
RESET state from any state in response to a low on TEL 

The state machine state diagram of FIG. 16 closely 5 
mirrors the more general descriptive state diagram previ- 
ously shown and described in regard to FIG. 6. For example, 
state 501 of FIG. 6 relates to the starting of the test, which 
in FIG, 16 relates to the TEI signal going high. State 502 of 
FIG. 6 relates to the configuring of a test, which in FIG. 16 10 
relates to the setup scan operation performed by transition 
through the SHIFT-TCR and UPDATE-TCR states 1608 and 
1610. State 503 of FIG. 6 relates to the capturing of data 
outputs from scan paths into scan collector, which in FIG. 16 
relates to the CPC state 1614. State 504 of FIG. 6 relates to is 
the filling and emptying of the scan distributors and scan 
collectors, which in FIG. 16 relates to the SHDC state 1616. 

State 605 of FIG. 6 relates to the inputting of data from 
the scan distributors to the scan paths, which in FIG. 16 
relates to the SHPSP state 1618. State 508 of FIG. 6 relates 20 
to the capturing of data by the scan paths, which in FIG. 16 
relates to the CPPSP state 1620. State 509 of FIG. 6 relates 
to exiting the test mode and returning the circuit (integrated 
circuit or core) back to the normal mode of operation, which 
in FIG. 16 relates to transitioning into the RESET state 1602. 25 

In FIG. 6, the inner loop, comprising state transitions 503, 
504, 605, and back to 503, used to capture data into the scan 
collectors from the scan paths, shift data in and out of the 
scan distributors and scan collectors, and load data into the 
scan paths from the scan distributors, relates to the inner 30 
loop of FIG. 16 comprising state transitions CPC 1614, 
SHDC 1616, SHPSP 1618, and back to CPC 1614. Also, in 
FIG. 6, the outer loop, comprising state transitions 503, 504, 
605, 508, and back to 503, used to additionally perform the 
step of capturing data from the combinational logic into the 35 
scan paths after the scan paths have been filled with data 
from the scan distributors as a result of performing the inner 
loop multiple times, relates to the outer loop of FIG. 16 
comprising state transitions CPC 1614, SHDC 1616, SHPSP 
1618, CPPSP 1620 and back to CPC 1614. 40 

In the state machine of FIGS. 15A and 16, it is seen that 
TPI is a single signal used for regulating the operation of 
multiple test control signal outputs from the state machine. 
While multiple control signals could be directly used, 
instead of having them generated by a state machine in 45 
response to a single TPI signal, it would increase the number 
of test control signal paths required to be routed to the 
controller. The advantage of having a single signal for 
regulating the operation of multiple test control outputs from 
a state machine will be seen later in regard to FIGS. 17, 18, 50 
and 19 where examples of the connectivity between hierar- 
chically arranged controllers are shown. 

Also, in the above description of the way the state 
machine controls scan distributor, scan collector, and scan 
path circuits it is important to note that the control outputs 55 
do not necessarily need to control the operations directly, but 
rather the control outputs may be used to provide timing 
windows within which the state control operation occurs. 
For example, when the CPPSP control output is generated in 
the CPPSP state, the timing to perform the capturing of data 60 
into the scan path may come from the CPPSP signal directly 
or, alternately, the CPPSP signal may simply provide a 
timing window in which a different signal is allowed to 
perform the capture operation. 

Hie different signal may for example be a functional clock 65 
signal that normally controls the registers of the scan path 
when they are in normal mode and not configured into test 
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mode as previously described in regard to FIGS. 4 and 5. 
Similarly, the other control signals, CPC, SHDC, and 
SHPSP may either directly control their stated operations, 
or, alternatively, each may provide a timing windows in 
which other signals may be allowed to control the stated 
operations. Also, if other signals are allowed to perform an 
operation, the number of times the other signals are allowed 
to perform the operation will be controlled by the number of 
TCI clocks consumed by the state machine during that 
timing window. For example, the state machine only 
remains in the SHPSP state for a single TCI clock period. If 
the SHPSP control signal enables another signal to shift the 
scan paths during the SHPSP state, the number of shifts will 
be limited to one, regardless of whether the other signal has 
a frequency much higher than the frequency of the TCI 
signal. 

Hierarchical Controller Arrangements and Operation 

In FIG. 17, multiple controllers 1702, 1704, and 1706 are 
be connected in a hierarchy. FIG. 17 relates to previous 
FIGS, 8 and 10 that depicted embedded cores, each having 
scan distributor and scan collector architectures, connected 
to form deep scan distributor and scan collector test channels 
accessible from the integrated circuits pads. As mentioned in 
regard to FIGS. 8 and 10, the integrated circuit's controller 
has authority over lower level controllers to allow the 
integrated circuit controller to establish test modes in lower 
lever controllers, via the test control register, and to syn- 
chronize the test operations of lower level controllers to the 
integrated circuit level controller. As previously mentioned, 
each of the embedded cores may have been an integrated 
circuit prior to being utilized as an embedded core. Each 
core therefore has the same controller inputs and outputs as 
would an integrated circuit controller, i.e. TPI, TCI, TEI, 
SDI, SDO, SDOl, TEO, and SDI1. Even if the embedded 
core were not previously integrated circuits, they would 
preferably be designed with these same controller inputs and 
outputs to allow hierarchically connecting the cores together 
as described below. 

FIG. 15 A has provided a detail view and description of the 
controller. FIG. 17 illustrates how the hierarchical intercon- 
nect structure between an integrated circuit controller 1702, 
a core 1 controller 1704 embedded within the integrated 
circuit, and a core 2 controller 1706 embedded within core 
1 is accomplished. The integrated circuit controller 1702 is 
connected to the integrated circuit pads via the previously 
described TPI, TCI, TEI, SDI, and SDO signals. The inte- 
grated circuit controller's SDOl output is connected to the 
SDI input of the core 1 controller 1704. The integrated 
circuit controller's SDI1 input is connected to the SDO 
output of the core 1 controller 1704. The integrated circuit 
controller's TEO output is connected to the TEI input of the 
core 1 controller 1704. Core 1 controller's SDOl output is 
connected to the SDI input of the core 2 controller 1706. 
Core 1 controller's SDI1 input is connected to the SDO 
output of the core 2 controller 1706. Core 1 controller's TEO 
output is connected to the TEI input of the core 2 controller 
1706. The TPI and TCI inputs to both core 1 and core 2 
controllers are directly connected to the integrated circuit's 
TPI and TCI pads, as is the TPI and TCI inputs to the 
integrated circuit controller. This hierarchical interconnect 
structure would continue if additional embedded cores were 
present in core 2, 

Based on the hierarchical interconnect structure descrip- 
tion given above, the steps of hierarchically accessing the 
controllers within the interconnect structure will now be 
described. Initially, all controllers will be reset by the TEI 
pad input being low. From the controller description of FIG. 
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ISA, if the TEI input is low, the TEO output will be low. 
Therefore, all controllers will be reset by a low on the TEI 
pad, due to the TEO and TEI connections between the 
controllers. If TEI is set high, a first scan operation of the 
integrated circuit controller's test control register can be 
performed. This first scan operation sets the TEI input to the 
core 1 controller 1704 high, and also selects core 1 to be 
inserted into the integrated circuit controller's scan path, via 
SDOl and SDH. A second scan operation is performed 
which scans data through both the integrated circuit and core 
1 test control registers. This second scan operation sets the 
TEI input to the core 2 controller 1706 high, and also selects 
core 2 to be inserted into the core 1 controller's scan path, 
via SDOl and SDI1. A third scan operation can now be 
performed to load control data into all test control registers 
of the integrated circuit, core 1, and core 2 controllers to 
begin a test. 

In this description, multiple test control register scan 
operations have been used to enable multiple embedded 
controllers to be added to the scan path of the integrated 
circuit controller. This multiple scan operation is facilitated 
by the design of the state diagram of FIG. 16, which 
provides a loop between the IDLE, SELECT-TCR, SHIFT- 
TCR, UPDATE-TCR, and IDLE states 1604, 1606, 1608, 
1610, and 1604. In this loop, the IDLE state serves as a 
synchronization state for adding test control registers of 
lower level controllers to the test control registers of higher 
level controllers. For example, when the TEO output from a 
higher level controller is input to the TEI input of a lower 
level controller during the UPDATE-TCR state, the lower 
level controller is enabled to follow the TPI input. Thus, as 
the enabling higher level controller transitions to IDLE from 
UPDATE-TCR in response to a low on TPI, the enabled 
lower level controller transitions to IDLE from RESET, also 
in response to the low on TPI. As a result, synchronization 
occurs between the enabling and enabled controllers by 
having both transitioning to the IDLE state, as shown in the 
state diagram of FIG, 16. 

In FIG. 17, the TPI and TCI pad signals are bussed 
directly to each controller 1702, 1704, and 1706. This allows 
the timing of each controller to be better maintained during 
scan and test operations, regardless of how many controllers 
are hierarchically connected. If for example, the TPI and 
TCI signals were routed through each controller prior to 
being input to a lower level controller, instead of being 
directly input to each controller from the integrated circuit 
pads, delays would accumulate in the TPI and TCI signal 
paths as more controllers are hierarchically connected. 
Eventually, the accumulated delay would reach a point 
where the timing of scan and test operations of lower level 
controllers would be degraded and the controllers would no 
longer be synchronized to the TPI and TCI pad signals. This 
would cause the test hierarchy to support only a limited 
number of connected controllers. However, by keeping the 
TPI and TCI pad signals directly bussed to all controllers, 
and sufficiently buffered to drive all controllers, the test 
hierarchy would not have the stated controller connection 
limit. 

While buffer circuitry is not shown in FIG. 17 and other 
figures, external signals input to the integrated circuit pads 
will be sufficiently buffered or otherwise amplified to allow 
them to drive the internal circuits they are connected to. U.S. 
Pat. No. 5,056,093 by Whetsel, column 20, lines 31-46, 
describes the advantage of directly bussing control signals. 

In FIG. 17, an advantage in wire routing in the integrated 
circuit results from using a single TPI signal to each con- 
troller to generate a plurality of control outputs used to 
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operate the scan distributor, scan collector, and scan path 
circuits. For example, if the control signals that operate the 
scan distributor, scan collector, and scan path circuits were 
not supplied by the state machine, but rather were supplied 

5 from integrated circuit pads and gated by control output 
from the test control register to all scan distributor, scan 
collector, and scan path circuits, the test control wire routing 
overhead in the integrated circuit would increase signifi- 
cantly. Furthermore, it is easier to design and route a single 

io signal path for minimum signal degradation and skew than 
it is to design and route multiple signal paths for minimum 
signal degradation and skew. Also, it is usually preferred to 
keep the number of integrated circuit pads dedicated for test 
to a minimum number, which the single TPI signal allows. 

15 In FIG. 18, another controller connection scheme is 
shown. Integrated circuit 1800 provides an integrated circuit 
controller 1802 connected to a selected one of a plurality of 
controllers 1804, 1806 via multiplexer circuitry 1808. This 
connection scheme was previously described in regard to 

20 FIG. 12, and is used whenever the number of core terminals 
used for testing consume most of the available integrated 
circuit pads, such that no other core can be tested simulta- 
neously due to lack of available pads. To select one of the 
cores 1804, 1806, the integrated circuit controller 1802 is 

25 scanned a first time to load the test control register to output 
mux control to the multiplexer circuitry and to insert the 
SDOl and SDI1 signals of the multiplexer to the integrated 
circuit controller's test control register scan path. In 
response to the mux control output, the multiplexer cir- 

30 cuitry; (1) connects the SDOl output from the integrated 
circuit controller to the SDI input of the selected core 
controller, (2) connects the SDI1 input of the integrated 
circuit controller to the SDO output of the selected core 
controller, and (3) connects the TEO output of the integrated 

35 circuit controller to the TEI input of the selected core 
controller. The multiplexer circuitry is designed to drive the 
TEI inputs of non-selected cores low, so that they are held 
in a reset state when they are not selected by the integrated 
circuit controller 1802. 

40 With this connection forming between the integrated 
circuit controller 1802 and the core controller, a second scan 
operation is performed which loads test control data into the 
test control registers of the integrated circuit and core 
controllers, to establish the test mode to be used. Following 

45 the second scan operation, the test can begin by inputting 
TPI control to the integrated circuit and core state machines 
to generate the control outputs to operate the scan distributor 
and scan collector architectures of the integrated circuit and 
core. When testing is complete, a third scan operation is used 

50 to deselect the core controller from the integrated circuit 
controller. If another core needs to be tested, the above 
sequence can be repeated to select, setup, and test the other 
core. If no other core needs to be tested, the TEI signal can 
be taken low to reset and disable all the controllers within 

55 the integrated circuit 1800, and the multiplexer circuitry 
1808. 

In FIG. 18, the TPI and TCI signals are bussed directly 
from the integrated circuit pads to the controllers to provide 
the timing and signal integrity advantage previously men- 

60 tioned in regard to FIG. 17. While the TPI and TCI signals 
could be connected to the selected core via the multiplexer 
circuitry 1808, as shown in FIG. 12, a directly bussed 
connection is preferred to avoid the delay introduced by the 
multiplexer circuitry. If the selected core contains further 

65 embedded cores, those cores can be selected, setup, and 
tested using the hierarchical connection approach described 
in regard to FIG. 17. 
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The structures depicted in FIG. 19 relates to the structures 
depicted in previous FIG. 14 where multiple cores and 
non-core circuits were described being directly connected to 
integrated circuit pads and tested in parallel. The difference 
between the structures of FIGS. 14 and 19 is that in FIG, 14 
each circuit's controller was connected to separate inte- 
grated circuit pads to allow each controller to be indepen- 
dently controllable, whereas in FIG. 19, all controllers 1902, 
1904, and 1906 are connected in the hierarchical fashion 
described in regard to FIG. 17 to allow one set of integrated 
circuit pads to control all circuit controllers. The process for 
setting up the hierarchically arranged controllers is the same 
as described in regard to FIG. 17. 

A difference between FIG. 17 and 19 is that in FIG. 17 the 
test data input to and output from each circuit passes through 
a serial pipeline connection formed via the circuit's scan 
distributor and scan collector circuits, as seen in FIG. 10. In 
FIG. 19 the test data input to and output from of each 
circuit's scan distributor and scan collector circuits is pro- 
vided by direct connection to integrated circuit pads. If the 
cores of FIG. 19 contain embedded cores, hierarchical 
testing as described in regard to FIGS. 10 and 17 can be 
performed. During scan and test operations, all controllers 
operate synchronous to the TPI and TCI pad inputs, which 
are shown directly connected to each controller. 
Pipelining Test Data through Scan Distributor and Scan 
Collector Circuits 

It is important to note that when multiple levels of scan 
distributor and scan collector circuits are connected to form 
deep serial test data input and output pipelines, as depicted 
to FIGS. 8, 10 and 11, the controller associated with each 
scan distributor, scan collector, and scan path circuit level 
can be setup, by scanning of the controller's test control 
register of FIG. ISA, to control the operation of the scan 
distributor, scan collector, and scan path circuits. As men- 
tioned in regard to FIGS. 8, 10, and 11, the controllers at 
each level can setup their scan distributor, scan collector, and 
scan path circuits for testing, or the controllers can setup 
their scan distributor and scan collector circuits as pipeline 
registers to transfer test data between integrated circuit pads 
and scan distributor, scan collector, and scan path circuits 
that are setup for testing. If scan distributor and scan 
collector circuits are being used as pipeline registers, the 
controller associated with the scan distributor and scan 
collector circuits does not output the CPC control signal 
previously described in FIGS. ISA and 16. The CPC signal 
causes the scan collector to capture data from the scan paths. 
When pipelining data through the scan collector to be output 
from an integrated circuit pad, the data must not be 
overwritten, as would occur if the CPC signal were output. 
The following example is given to illustrate how data 
pipelining preferably works when intermediate scan dis- 
tributor and scan collector circuits, not being used for 
testing, exists between a tester and scan distributor, scan 
collector, and scan path circuits that are being used for test. 

In FIG. 11, the level 4 scan distributor, scan collector, and 
scan path circuits have been setup for testing and the scan 
distributor and scan collector circuits at levels 1-3 have been 
setup for pipelining data between a tester contacting the 
integrated circuit pads and the level 4 scan distributor, scan 
collector, and scan path circuits. The following steps are 
performed during this test. 

The first step is to input 40 bits of data into the four 
serially connected 10-bit scan distributors and output 40 bits 
of data from the four serially connected 10-bit scan collec- 
tors during the SHDC state (see FIG. 16 for all state 
references). After this first step, the level 4 scan distributor 
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is loaded with the first 10 bit data pattern to be shifted into 
the level 4 scan paths during the SHPSP state. Also follow- 
ing this first step, the level 1-3 scan distributors have been 
loaded with the next three 10 bit patterns that will eventually 

5 be shifted into the level 4 scan distributor and transferred 
into the level 4 scan paths. 

The second step is to transfer the 10-bit pattern from the 
level 4 scan distributor into the scan paths during the SHPSP 
state, then capture the 10-bit data output from the scan paths 

10 into the level 4 scan collector during the CPC state. Note that 
since level 1-3 scan collectors have been setup to operate as 
pipeline registers, their controllers do not output the CPC 
control during the CPC state. As previously mentioned, 
outputting CPC control to level 1-3 scan collectors would 

15 overwrite data being pipelined to the tester from the level 4 
scan collector circuit. 

The third step is to input 10 bits of data into the four 
serially connected scan distributors, and output 10 bits of 
data from the serially connected scan collector circuits in the 

20 SHDC state. Following this step, the level 1 scan distributor 
contains a new 10 bit data pattern from the tester, the level 
2 scan distributor contains the 10 bit data pattern previously 
in the level 1 scan distributor, the level 3 scan distributor 
contains the 10 bit data pattern previously in the level 2 scan 

25 distributor, and the level 4 scan distributor contains the 10 
bit data pattern previously in the level 3 scan distributor. 
Also following this step, the tester contains the 10 bit data 
pattern previously in the level 1 scan collector, the level 1 
scan collector contains the 10 bit data pattern previously in 

30 the level 2 scan collector, the level 2 scan collector contains 
the 10 bit data pattern previously in the level 3 scan 
collector, and the level 3 scan collector contains the 10 bit 
data pattern previously in the level 4 scan collector. 
Following the third step, the second step is repeated, then 

35 the third step is repeated. This sequence of repeatedly doing 
the second and third steps continues until the level 4 scan 
paths are filled with data, at which time a fourth step of 
capturing data into the level 4 scan path during the CPPSP 
state occurs. Following the fourth step, the sequence of 

40 repeatedly doing the second and third steps continues, 
periodically performing the fourth step as the level 4 scan 
paths fill with data. Eventually the test is complete and the 
controllers are transitioned back into their RESET states. 
Note that the last scan distributor and scan collector shift 

45 operation in the third step needs to be of sufficient duration 
to allow the last data captured into the level 4 scan collector 
from the scan paths to be communicated to the tester. 

What is important understand in the above example is the 
ability of non-testing scan distributor and scan collector 

50 circuits, located between a tester and testing scan distributor 
and scan collector circuits, to serve as pipeline registers 
which provide temporary storage for data being transferred 
between the tester and testing scan distributor and scan 
collector circuits. For example, in step one above the tester 

55 had to initially transfer 40 bits of data into and out of the 
level 1-4 scan distributor and scan collector circuits, to get 
the first 10 bit data input and output pattern to and from the 
level 4 scan distributor and scan collector circuits. If the 
level 1-3 scan distributor and scan collector circuits could 

60 not be controlled to operate in the pipeline mode, this 40 bit 
transfer would have to occur each time a new 10 bit data 
input and output pattern is required to be transferred 
between the tester and level 4 scan distributor and scan 
collector circuits. In this example, this would extend the test 

65 time by approximately a factor of 4. However, since the level 
1-3 scan distributor and scan collector circuits can be 
controlled to operate in a pipeline mode during steps 2 and 
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3, the tester only has to communicate 10 bits of data during 
each SHDC state. Again, the key to this is the ability to scan 
the test control registers of the level 1-3 controllers to cause 
their state machines to not output the CPC control signal 
during the CPC state. 

While this example uses a 40 bit deep pipeline, other 
examples may have shorter or longer pipelines, depending 
upon the depth of the scan distributor and scan collector test 
hierarchy being tranversed. Using the pipelining approach 
described above advantageously cancels out the depth of any 
scan distributor and scan collector test hierarchy, and 
enables the tester to be viewed as being directly connected 
to the testing scan distributor and scan collector circuitry, 
regardless of the pipeline bit length between the tester and 
testing scan distributor and scan collector circuitry. Being 
able to test a circular, a core for example, embedded N levels 
deep in approximately the same amount of time as it would 
take to test the same circuit in a non-embedded or direct 
fashion is a very important aspect of the present invention. 

Another important aspect of the pipelining capability is 
the fact that the test patterns used to test the embedded 
circuit in the example above, are the same test patterns used 
to test the same circuit if it were not embedded. The only 
difference in the test patterns is that they must be temporarily 
registered along the additional non-testing scan distributor 
and scan collector circuits prior to being input and output to 
the target circuit being tested via the testing scan distributor 
and scan collector circuits. 
Test Pattern Formatting 

In conventional scan path design, test patterns are for- 
matted to allow a tester to scan directly into and out of a scan 
path, as described in regard to FIG. 2. However, using the 
present invention the test patterns need to be formatted 
differently to allow navigating the test patterns through scan 
distributor and scan collector circuits located between the 
tester and scan paths. 

In FIG. 20A, system 2000 provides an integrated circuit 
2002, a tester driver 2004 and a tester receiver 2006. 
Integrated circuit 2002 contains a simplified example of a 
pair of scan paths, scan path 1 2008 and scan path 2 2010, 
interfaced to the tester driver and receiver channel by 2 bit 
deep scan distributor and scan collector circuits 2012, 2014. 
While the ideal case would be for all scan paths to be of the 
same bit length, that may not always be the case. To illustrate 
how scans to different length scan paths are performed using 
the present invention, scan path 1 is shown being 5 bits long 
and scan path 2 is shown being 4 bits long. The tester driver 
2004 comprises a shift register means 2016 for transmitting 
data to the scan distributor and scan paths, a memory means 
2018 for storing data transmitted by the shift register, and a 
control means 2020 for regulating the operation of the shift 
register and memory. The tester receiver 2006 comprises a 
shift register means 2022 for receiving data from the scan 
collector and scan paths, a memory means 2024 for storing 
data received by the shift register, and a control means 2026 
for regulating the operation of the shift register and memory. 

To support the scan distributor circuit interface between 
the tester and scan paths, the data stored in the driver 
memory is formatted into left and right columns. In the 
driver memory 2018, the data shown in the left column 
(D1-D5) is the data that would normally be shifted into scan 
path 1, if scan path 1 was conventionally connected directly 
to a tester driver. Similarly, the data (D1-D4) in the right 
column of the driver memory is the data that would normally 
be shifted into scan path 2, if scan path 2 was conventionally 
connected directly to another driver. However, since scan 
path 1 and scan path 2 are interfaced to the same tester 



driver, via scan distributor, the data output to scan path 1 and 
scan path 2 is formatted into left and right columns as 
shown. 

To support the scan collector circuit interface between the 

5 tester and scan paths, the data stored in the receiver memory 
is formatted into left and right columns. In the receiver 
memory 2024, the data shown in the right column (D1-D5) 
is that data that would normally be shifted out of scan path 
1, if scan path 1 was conventionally connected directly to a 

10 tester receiver. Likewise, the data (D1-D4) in the left 
column of the receiver memory is the data that would 
normally be shifted out of scan path 2, if scan path 2 was 
conventionally connected directly to another tester receiver. 
However, since scan path 1 and scan path 2 are interfaced to 

15 the same tester receiver, via scan collector, the data input 
from scan path 1 and scan path 2 is formatted into left and 
right columns as shown. 

In operation, the driver's controller loads the shift register 
with the first row of left and right column data, i.e. D5 and 

20 X, from the driver memory. The controller then causes the 
shift register to shift the left and right column data (D5 and 
X) into the scan distributor, such that X inputs to scan path 
2 and D5 inputs to scan path 1. The scan paths are then 
shifted to input the D5 and X. Next, the second row of left 

25 and right column data in the memory, D4 and D4, is 
similarly loaded into the shift register, shifted into the scan 
distributor, then shifted into the scan paths. This process 
repeats with subsequent rows of left and right column data 
until the scan paths have been filled, such that scan path 1 

30 contains D1-D5 and scan path 2 contains D1-D4. The X bit 
in the first row of data shifted out is a placeholder that serves 
to pad or balance the data being shifted into the uneven 
length scan paths. If the scan paths had even lengths, the X 
would not be required. 

35 Simultaneous with the above described driver operation, 
the receiver's controller operates the receiver shift register to 
shift in data from the scan collector, as scan distributor is 
being shifted, such that data that has been captured into scan 
collector from scan path 1 is stored into the right column of 

40 the receiver memory and data that has been captured into 
scan collector from scan path 2 is stored into the left column 
of the receiver memory. At the end of the above described 
driver output operation, where its memory has output rows 
of left and right column data to the scan distributor, the 

45 receiver memory will have filled with rows of left and right 
column data from the scan collector. The left column of the 
receiver memory is filled with scan path 2 data (D1-D4), 
and the right column is filled with scan path 1 data (D1-D5). 
Again due to the uneven length between scan path 1 and 

50 scan path 2, the last data input to the left column of the 
receiver memory from scan path 2 will be X. With even 
length scan paths, no X's would be input to the receiver 
memory. 

After the above described shift in and out sequence 
55 occurs, the next formatted data to be shifted out to the scan 
paths is available in the driver memory, and new locations 
of receiver memory are available for storing data shifted out 
of the scan paths during the next sequence. 

In FIG. 20B, system 2050 provides an integrated circuit 
60 2052, tester driver 2054 and tester receiver 2056. This 
embodiment illustrates another example of data formatting 
where the tester must communicate with scan distributors 
and scan collectors pairs of uneven length. Scan distributor 
2058 and scan collector 2060 pair connect to two scan paths 
65 2062 and 2064, as described above. Scan distributor 2066 
and scan collector 2068 pair connect to three scan paths 
2070, 2072, and 2074, each having different lengths. The 
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basic operation is the same as described in FIG. 20A, with 
the exception that a third column of data must be formatted 
for the driver and receiver memories. This third column is 
used for inputting and outputting data to the third scan path 
2074 of the scan distributor and scan collector pair 2066, 5 
2068. While the scan distributor and scan collector pair 
2058, 2060 does not have a third scan path to communicate 
to, its driver and receiver memories are formatted to include 
a third column of X's to pad or balance the data input and 
output communication of the scan distributor and scan 10 
collector pairs 2058, 2060 and 2066, 2068. So, while uneven 
length scan paths require padding bits as described in regard 
to FIG. 20 A, uneven length scan distributor and scan col- 
lector pairs require padding columns as depicted in FIG. 
20B. It is important to note that even though the scan 15 
distributor and scan collector pair 2058, 2060 is not testing 
as efficiently as it was in FIG. 20A due to the additional 
shifts required for the X bits and X columns, it is testing 
while the scan distributor and scan collector pair 2066, 2068 
is testing. 20 
Power Reduction Advantage 

During scan testing, conventional scan paths, as described 
in regard to FIG. 2, shift data in and out at the frequency of 
the scan clock. As previously described in regard to FIG. 4, 
a scan path inputs data to and receives data from combina- 25 
tional logic being tested. Thus, in a conventional scan path, 
as data shifts through the scan path the inputs to the 
combinational logic from the scan path may transition at the 
scan clock frequency. Transitioning the inputs of the com- 
binational logic consumes power which produces heat in the 30 
integrated circuit. The amount of power consumed is related 
to the frequency of the input transitions to the combinational 
logic, which is related to the scan clock frequency shifting 
data through the scan path. 

In FIG, 21 A, integrated circuit 2100 includes a section of 35 
a scan distributor 2102 and scan collector 2104 circuit 
connecting two scan paths 2106, 2108. The scan paths input 
data to and receive data from combinational logic 2110, via 
functional input (FI) and functional output (FO) signals. 
Each scan path comprises a series of connected conventional 40 
scan cells, as shown in FIG. 21B. In the scan paths, the 
dotted boxes indicate the presence of the scan cell and the 
connection of the scan cells to each other, via serial input 
(SI) and serial output (SO), and to the combinational logic, 
via FI and FO. The test mode operation of the FIG. 2 IB scan 45 
cell to capture FI data and shift data from SO to SI is well 
known in the art of scan testing. While only two scan paths 
are shown in FIG. 21 A, the scan distributor and scan 
collector circuits may be connected to many additional scan 
paths, each scan path being similarly connected to combi- 50 
national logic via FI and FO. 

In FIG. 21A, each scan cell provides outputs to the 
combinational logic, via FO, and to the next scan cell's SI 
input, via SO, except for the last scan cell which outputs to 
the combinational logic and the scan collector. FO and SO 55 
are the same node. There are techniques used to isolate FO 
and SO during scan so that FO is static while SO outputs. 
However, these techniques require adding and inserting 
circuitry, such as latches or gates, in the FO signal path 
between the scan cell output and input to the combinational 60 
logic and controlling the added circuitry to update at the end 
of each scan operation. Examples of such isolation circuitry 
is described in IEEE 1149,1 standard. The first scan cell's SI 
input is connected only to the scan distributor 2102. Each 
FO to the combinational logic is shown fanned out to many 65 
combinational logic inputs, which is typical. Capacitor CI 
represents the capacitive load of the SI input of the scan cell 



driven by the scan distributor. Capacitor C2 represents the 
capacitive load of combinational logic inputs driven by FOs. 
Capacitor C3 represents the combined capacitive load asso- 
ciated with the all gate interconnects within the combina- 
tional logic. Capacitance CI is small compared to capaci- 
tances C2 and C3. 

If the scan paths 2106, 2108 were connected to integrated 
circuit pads as conventional scan paths, instead of to scan 
distributor and scan collector circuits, they would scan data 
at the scan clock input frequency. If the scan clock frequency 
were 100 MHz, and alternating data bits were shifted 
through the scan path during each scan clock period, the FO 
outputs from the scan paths would transition at 100 MHz. 
This means that each C2 combinational logic input load 
would charge and discharge at that frequency. Also, the 
capacitance C3 gate interconnect load will charge and dis- 
charge in response to the transitions at the combinational 
logic inputs. The power consumed during test by the charg- 
ing and discharging of capacitances C2 and C3 increases as 
scan clock frequency increases and decreases as scan clock 
frequency decreases, 

When using the scan distributor and scan collector circuits 
2102, 2104 to scan data through the scan paths, the power 
consumed during test is reduced, compared to the conven- 
tional scan description above, since the scan clock frequency 
of the scan path circuits is reduced. For example, operating 
10 bit scan distributors and scan collectors 2102, 2104 using 
a 100 Mhz clock and according to the previously described 
inner loop of the controller state diagram of FIG. 16 (i.e. the 
state transition loop from the CPC state to the SHDC state 
to SHPSP state and back to CPC state) will result in 
transitioning through the SHPSP state once every twelve 100 
Mhz clock cycles. Since SHPSP is the state that scans data 
into scan paths 2106, 2108 from the scan distributor 2102, 
the scan paths are scanned at frequency of 100 Mhz/12 or 8.3 
Mhz. Charging and discharging the capacitances C2 and C3 
loads at this slower frequency reduces the power consumed 
during test. Operating the scan distributor at 100 MHz will 
cause the capacitance CI load it drives to charge and 
discharge at 100 MHz, but since capacitance CI is small 
compared to capacitances C2 and C3, the power consumed 
is negligible. Also, as previously described in regard to FIG. 
6, the test time is not significantly decreased when using the 
scan distributor and scan collector circuits 2102, 2104 since 
an amplified number of smaller length scan paths are 
capable of being used to transmit test data to and from 
combinational logic being tested. 

The following example illustrates the power reduction 
possible using scan paths connected to 10 bit scan distribu- 
tors and scan collectors as shown in FIGS, 4 and 21 A, rather 
than using conventional scan paths connected to pads as 
shown in FIG. 2. The power consumed by C2 and C3 of 
combinational logic 2100 if connected to conventional scan 
path 1 200 of FIG. 2 during scan operations can be estimated 
by; P-CV^F, where C is the lumped combinational logic C2 
and C3 capacitance described in FIG. 21A, V is the IC 
voltage, and F is the frequency of the scan path FO outputs. 
The power consumed after modifying scan path 1 of FIG, 2 
into a group of 10 shorter length scan paths of FIGS. 4 and 
21Aand connecting them to 10 bit scan distributors and scan 
collectors, as described in regard to FIG. 3, can be estimated 
by; P«CV 2 (F/12), where C is again the lumped combina- 
tional logic C2 and C3 capacitance, V is again the IC 
voltage, and (F/12) is the scan frequency of the modified 
scan path FO outputs (i.e. (F/12) is the frequency of SHPSP 
state transitions in the inner loop as described above). 

From this example it is seen that for a given C, V, and F, 
the power consumed by combinational logic 2110 being 
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scan tested using scan paths modified for connection to scan 
distributor and scan collector circuits of FIGS. 4 and 21A is 
approximately Vi2 the power consumed by the same com- 
binational logic 2110 if it were scan tested using the con- 
ventional scan path arrangement of FIG. 2. 5 

It is important to note that test power consumption 
decreases further as the depth of the scan distributor and 
scan collector increases, since the frequency of the SHPSP 
state in the inner loop of FIG. 16 state diagram decreases. 
For example, with 40 bit deep scan distributors and scan 1Q 
collectors connected to forty 25 bit scan paths, appropriately 
modified from the FIG. 2 scan path as described in FIG. 3, 
the power can be estimated by; P«CV 2 F/42, where F/42 is 
the frequency of the scan paths FO outputs (i.e. the fre- 
quency of the SHPSP state of the inner loop of FIG. 16). 
From this example it is seen that the power consumed by 35 
combinational logic 2110 being scan tested using 40 bit scan 
distributors and collectors and appropriately modified scan 
paths is approximately V42 the power consumed by the same 
combinational logic 2110 if it were scan tested using the 
conventional scan path arrangement of FIG. 2. 20 

It is also important to note that as the depth of scan 
distributors and scan collectors increase, the scan cycle time 
of scan distributor and scan collector arrangements approach 
the scan cycle time of conventional scan path arrangements, 
as described previously in regard to FIGS, 4 and 6 above. 25 
Therefore increasing the depth of scan distributors and scan 
collectors advantageously reduces both IC test power con- 
sumption and IC test time. 

In FIG. 22 A, arrangement 2200 is used to further reduce 
the power consumed during testing. In the description of 30 
FIGS. 21A and 21B the transitioning of the scan path's FO 
outputs was described to occur in response to a single scan 
clock used to shift data through the scan path. Using the 
same scan clock to shift data through all scan paths causes 
simultaneous transitions on the FO outputs of all the scan 35 
paths. This causes all the capacitive C2 and C3 loads driven 
by FO outputs to be charged or discharged simultaneously, 
which will consume the most power. 

As described previously in regard to FIGS. 15A and 16, 
the SHPSP state outputs a SHPSP signal which can either 40 
shift the scan paths directly or serve as a timing window to 
enable another signal to shift the scan paths. The example in 
FIG. 22A provides for the SHPSP signal to be used as a 
timing window to allow another signal to produce a strobe 
that clocks the scan path circuits. The SHPSP and other 45 
signal 2202, a functional clock for example, axe input to a 
synchronizer circuit 2204. The synchronizer is enabled by 
SHPSP being high to allow one of the clock pulses of the 
other signal to pass through to the synchronizer's strobe 
output. Note that the design of the synchronizer only allows 50 
one clock pulse to be output on the strobe output 2206 even 
though the signal produces multiple clock pulses during the 
SHPSP timing window (i.e. while SHPSP is high). In other 
states, such as SHDC, where the state machine remains for 
longer than one TCI clock, a new timing window will be 55 
produced for each TCI clock that occurs during the state. 
However, as described above, only one strobe output will be 
produced within each new timing window. 

If the strobe were directly input to all scan paths, all the 
scan paths would shift at the same time. This would produce 60 
the simultaneous charge or discharge situation mentioned 
above. To prevent this, the strobe is input to a series of 
buffers 2208, 2210, 2212, and 2214 connected such that the 
output of the first buffer drives scan path 1 and the input of 
the second buffer, the second buffer drives the input of scan 65 
path 2 and the input of the third buffer, and so on until the 
last buffer drives only the last scan path. 
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The power reduction made possible by the scan distribu- 
tor and scan collector architecture alone, or in combination 
with the synchronizer and delay circuitry described in FIG. 
22A, enables more circuits in an IC to be tested in parallel. 
For example, if an IC contains multiple circuits to be tested, 
it is preferable to test all the circuits in parallel to reduce the 
IC's test time, which reduces wafer and IC manufacturing 
cost. However, if each circuit uses conventional scan design 
it may not be possible to test all circuits in parallel since the 
power consumed by simultaneously testing all circuits may 
exceed the ICs power handling capacity. Therefore, using 
conventional scan design, the test time of an IC may increase 
since circuits in the IC may need to be tested one at a time 
to limit the test power consumption. However, using the 
scan distributor and scan collector architecture it may be 
possible to test all circuits in an IC in parallel and therefore 
reduce IC test time, which reduces costs. 

FIG. 22B depicts the timing of these series connected 
signals. 

Each scan distributor and scan collector pair could have 
its own synchronizer and clock skewing buffer arrangement 
2200, or one synchronizer and clock skewing buffer arrange- 
ment could be used for all scan distributor and scan collector 
pairs. Alternately, one synchronizer could be used to provide 
a common strobe signal to multiple clock skewing buffer 
arrangements. Using this approach, the shifting of data 
through each scan path will be staggered in time. Therefore 
the transitions on the FO outputs of each scan path will be 
staggered in time, as will the charging and discharging of 
capacitances C2 and C3 driven by the FO outputs. Simul- 
taneous power consumption will therefore be reduced. 

While the example circuit of FIG. 22 A is shown produc- 
ing skewed strobe outputs to reduce the simultaneous power 
consumed by the scan paths in a given test timing window, 
the circuit could also be used in normal functional operation 
to reduce simultaneous power consumed by functional reg- 
isters in a given functional timing window. 
Test Controller Programming for Optimized Testing of Par- 
ticular Circuits 

It is important to note that while the scan distributor and 
scan collector architecture has been shown testing combi- 
national logic, other types of circuits can be tested as well, 
including memories, such as RAMs, and mixed signal 
circuits, such as digital to analog converters (DAC) and 
analog to digital converters (ADC). The following examples 
illustrate how the testing of other types of circuits is accom- 
plished using the scan distributor and scan collector archi- 
tecture. 

Improved Testing of Embedded Memory Cores 

In FIG. 23A-1, integrated circuit 2300 includes a RAM 
memory 2302 connected, in test mode, to two scan distribu- 
tor circuits 2304, 2306 and a scan collector circuit 2308. The 
scan distributor and scan collector circuits are connected, in 
test mode, to integrated circuit pads or core terminals 2310, 
2312, 2314. One of the scan distributor circuits 2304 pro- 
vides data input (DI) to the RAM and the other scan 
distributor circuit 2306 provides address input (AI) to the 
RAM. The scan collector circuit 2308 provides data output 
(DO) from the RAM. 

FIG. 23A-2 shows how the SELECT-TEST portion 2316 
of the state diagram of FIG, 16 is programmed to operate 
when testing the RAM. The programming control of the 
state machine is input to the state machine from the test 
control register, as mentioned previously in regard to FIG. 
15A. 

While many types of RAM test sequences can be pro- 
grammed into the state machine, this test is programmed to 
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repeat the steps of addressing the RAM, writing data to the repeat the steps of inputting analog input to the ADC, 

addressed location, then reading back the data written into converting the analog input into a digital output, and out- 

the addressed location. At the beginning of the test, the state putting the digital output to a tester for inspection. At the 

machine enters the SHDC state 2318 (from the SELECT- beginning of the test, an analog input from an external tester 

TEST and Read states) to shift data and address into the scan 5 ^ mput t0 the ADC via a pad/terminal Next, the state 

distributors, and data from the scan collector. Next, the state machine enters the Convert state 2360 (from the SELECT- 

machine enters the Write state 2320 to store data shifted into j^j state) t0 causc the ADC t0 invert the analog input 

the scan distributor into the RAM location addressed by the mt0 ^ QUt t Jne ^ machine remains in me Convert 

address shifted into the other scan d^tnbuton Next, he state &mt { h for me conversioD t0 take lace . Next> tne 

machine enters the Read state 2322 to read back the data Jrt „ 4 u* * *u ™^ . * «^ . 

from the addressed location into the scan collector. The data 10 machin t e " C? f state M 2362 k to capture the 

read back should equal the data written. digital output mto he scan collector. Next the state machine 

During the Write state the controller outputs control to the en Jf r the SDlfl coUector < SHC ) state 2364 t0 shlfl the ^ n 

RAM to write data. During the Read state the controller collector to output the digital output to an external tester, 

outputs control to the RAM to read data and also outputs P rocess re P eats """1 all digital output codes, represen- 

control to cause the scan collector to capture the data being 15 tative of the applied analog inputs, have been output from 

read. This process of shifting the scan distributors and scan me t0 & c tester - The test can repeat with a different 

collector to input data and address to and output data from analog input signal and resulting digital outputs if desired, 

the RAM, in combination with appropriately controlling the At the end of this and the other two test examples above, the 

RAM to write and read data, repeats until all RAM locations state machine returns to either the IDLE or RESET state as 

have been written to and read from. The test can repeat with 20 shown in FIG. 16. 

another set of data to be written and read into each address. It is important to note in the above examples, that if the 

Improved Testing of Embedded Mixed Signal Cores RAM, DAC, or ADC is a core embedded deep inside an 

In FIG. 23B-1, an integrated circuit 2330 includes a integrated circuit, the previously described method of pipe- 
digital to analog converter (DAC) 2332 connected, in test lining data, can be used to improve digital test data band- 
mode, to a scan distributor circuit 2334 at its digital input 25 width to and from the circuits. Also note that if pipelining is 
and to an integrated circuit pad or core terminal 2336 at its used, the controllers of the intermediate scan distributor and 
analog output. The scan distributor circuit 2334 is scan collector circuits, that pipeline the data, must be 
connected, in test mode, to an integrated circuit pad or core programmed to operate according to the state diagrams of 
terminal 2338. The scan distributor circuit 2334 provides the FIGS. 23A-1, 23B-1, and 23C-1 for synchronous operation, 
digital input to the DAC and the analog output provides 30 Further, the pipelining controllers of intermediate scan dis- 
analog output from the DAC. tributor and scan collector circuits must disable the capture 

FIG. 23B-2 shows how the SELECT-TEST portion 2340 collector signals during the read state 2322 of 23A-2 and 

of the state diagram of FIG. 16 is programmed to operate CPC state 2362 of 23C-2 to avoid overwriting data being 

when testing the DAC. pipelined, as previously mentioned. As mentioned previ- 

While many types of DAC test sequences can be pro- 35 ously in regard to FIGS. 15 A, 16, and 22A, the control 
grammed into the state machine, this test is programmed to signals output from the controllers may control testing 
repeat the steps of inputting digital data to the DAC, directly or may operate as timing windows within which 
converting the digital data into an analog output, and out- another signal may be enabled to control testing, 
putting the analog output to a tester for inspection. At the Hierarchical Routing of Analog Test Signals 
beginning of the test, the state machine enters the shift 40 While not shown in the above examples, multiplexing 
distributor state (SHD) 2342 (from the SELECT-TEST state) circuitry is provided to allow the circuits to be connected to 
state to shift digital data from an external tester into the scan the scan distributor, scan collector, and analog input and 
distributor. Next, the state machine enters the Convert state output signals while in test mode, and to functional inputs 
2344 to cause the DAC to convert the digital data from the (FI) and outputs (FO) during normal mode, similar to that 
scan distributor into an analog output. The analog output is 45 shown in FIG. 8. It is also important to note that, in the 
inspected by an external tester connected to the analog above examples, the circuits being tested are connected, in 
output pad/terminal. The state machine remains in the Con- this particular test mode, directly to the scan distributor and 
vert state long enough for the conversion to take place and scan collector circuits, and not through scan paths as pre- 
fer the tester to inspect the analog output, then enters the viously described in the testing of combinational logic. Also, 
SHD state to load the next digital input to be converted into 50 the internal routing of analog inputs and outputs between the 
analog output and inspected. This process repeats until all external tester contacting the integrated circuit pad and a 
digital input codes have been input to the DAC and con- terminal to an embedded DAC or ADC should be designed 
verted into analog outputs. The test can repeat with a carefully so that the analog signal is not significantly 
different sequence of digital inputs if desired. degraded or otherwise modified. 

In FIG. 23C-1, an integrated circuit 2350 includes an 55 In FIG. 24, for example, the direct routing scheme 

analog to digital converter (ADC) 2352 connected, in test described for the TPI and TCI signals in FIG. 17 can be used 

mode, to a scan collector circuit 2354 at its digital output and for routing the analog test input and output signals between 

to an integrated circuit pad or core terminal 2356 at its integrated circuit pads 2406, 2408 and terminals 2424, 2426 

analog input. The scan collector circuit is connected, in test of embedded mixed signal cores 1, 2, and 3 within the 

mode, to an integrated circuit pad or core terminal 2358. The 60 integrated circuit 2400, The analog multiplexer and demul- 

scan collector circuit 2354 provides digital output from the tiplexer circuitry, such as 2402, 2404, at the analog test input 

ADC and the analog input provides analog input to the ADC. and output terminals 2424, 2426 of the cores can be con- 

FIG. 23C-2 shows how the SELECT-TEST portion of the trolled by the integrated circuit's controller to allow either 

state diagram of FIG. 16 is programmed to operate when test or functional input and output, as described in FIG. 8. 

testing the ADC. 65 The analog multiplexer and demultiplexer circuitry may be 

While many types of ADC test sequences can be pro- designed for bi-directional operation using transmission 

grammed into the state machine, this test is programmed to gates, or for unidirectional operation using buffers. When 
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not used for output ting analog test signals, the test outputs 
of the demultiplexers 2404 are disabled from driving pad 
2408. Core 3 2410 includes a direct routing scheme that is 
hierarchical to allow further connection to core 3 A 2412 and 
core 3B 2414 within Core 3. Within Core 3, the operation of 5 
the analog multiplexers and demultiplexers 2416, 2418 of 
Core 3A and Core 3B are controlled by Core 3's controller, 
not the integrated circuit's controller. 

Also in FIG. 24, integrated circuit 2400 includes isolation 
switches 1 and 2 (IS1 and IS2) 2420, 2422 on the integrated 10 
circuit's analog input and output pads. In normal mode, the 
integrated circuit's controller closes IS1 and IS2, and in test 
mode, the integrated circuit's controller opens IS1 and IS2. 
Opening IS1 in test mode isolates the pad 2406 from 
functional inputs (FI) it may be connected to in normal is 
mode, which eliminates loading and prevents the analog test 
inputs from effecting the functional inputs (FI) of circuitry 
connected to the pad during normal integrated circuit opera- 
tion mode. Opening IS2 in test mode isolates the pad 2408 
from functional outputs (FO) it may be connected to in 20 
normal mode, which allows the analog test output from a 
selected core to drive out on the pad without opposition from 
functional outputs. While not shown, similar isolation 
switches exist within each embedded core. The isolation 
switches of each core are controlled by the core's controller. 25 

In regard to the testing of digital circuits, similar isolation, 
to that mentioned above, is provided for digital test input and 
output as described in FIG. 14Aof U.S. Pat. No. 5,606,566 
to Whetsel, the patent mentioned in regard to FIG. 2. 
According to the present invention as described above for 30 
analog test input and output isolation, digital test input and 
output isolation at the integrated circuit level is controlled by 
the integrated circuit's controller while digital test input and 
output isolation at the core level is controlled by the core's 
controller. 35 
Modifying the IEEE 1149.1 TAP for Use in Scan Distributor 
and Scan Collector Architectures 

The description of the scan distributor and scan collector 
architecture has shown how it can be used in an integrated 
circuit and within cores embedded within integrated circuits. 40 
Another test architecture that can be used in integrated 
circuits and within cores embedded within integrated cir- 
cuits is the IEEE 1149.1 Test Access Port and Boundary 
Scan Architecture. The following description illustrates how 
the scan distributor and scan collector architecture and the 45 
IEEE 1149.1 architecture can be designed to coexist within 
an integrated circuit or within cores embedded within inte- 
grated circuits. Of particular importance is the way the IEEE 
1149.1 architecture will be shown modified or improved to 
allow it to utilize the same test interface as is used by the 50 
scan distributor and scan collector architecture, and to use 
the same method of hierarchical connectivity used by the 
scan distributor and scan collector architecture. 

In FIG. 25 A, a conventional 1149.1 Test Access Port 
(TAP) 2500 comprises inputs and output for a test data input 55 
(TDI), test data output (TDO), test mode select (TMS), and 
test clock (TCK). The TAP also comprises a TAP controller 
state machine 2502, an instruction register 2504, a plurality 
of data registers 2506, multiplexer 1 (Muxl) 2508, and 
multiplexer 2 (Mux2) 2510. The TAP controller is connected 60 
to TMS and TCK, and responds to these signals to shift data 
through either the instruction register or a selected data 
register, from TDI to TDO. During instruction register shift 
operations, the TAP controller causes Mux2 to connect the 
output of the instruction register to TDO. During data 65 
register shift operations, the instruction loaded in the 
instruction register selects one of the data register outputs to 
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be output from Muxl, and the TAP controller controls Mux2 
to connect the output of Muxl to TDO. The structure and 
operation of the TAP, as it will be referred to hereafter, is 
widely understood. 

In FIG. 25B, a conventional TAP 2530, like the TAP 2500 
of FIG. 25A, is modified to allow it to coexist and operate 
with the scan distributor and scan collector controller of 
FIG. 15 A. The modifications include: (1) inserting a third 
multiplexer (Mux3) 2532 between the output of Mux2 2534 
and TDO, which corresponds to the multiplexer in FIG. 
15A; (2) providing a TDI1 input to Mux3, which corre- 
sponds to SDH of FIG. 15A; (3) providing a TDOl output 
from Mux2, which corresponds to SDOl of FIG. ISA; (4) 
providing instruction control to Mux3 to allow an instruction 
to select TDOl or TDI1 to be output to TDO, which 
corresponds with the control output from the test control 
register to the multiplexer of FIG. 15A; (5) providing a port 
enable input (PEI) signal to the TAP controller 2536 to 
enable or disable the TAP, which corresponds to the TEI 
signal to the test control state machine of FIG. 15A; and (6) 
providing a port enable output (PEO) signal from the TAP's 
instruction register 2538 to enable lower level TAPs, which 
corresponds to the TEO signal from the test control register 
of FIG. 15A. 

With these modifications, the TAP 2530 operates as the 
conventional TAP 2500 when enabled by PEI and while 
Mux3 is controlled to make a connection between Mux2 and 
TDO. When Mux3 is controlled by an instruction shifted 
into the instruction register 2538 to insert a scan path 
between TDOl and TDI1 into the TAPs TDI and TDO scan 
path, the TAP 2530 leaves the conventional mode of opera- 
tion and enters the new mode of operation made possible by 
the present invention. 

It is important to note that control of Mux3 2532 is only 
possible by performing an instruction scan operation. Data 
scan operations through the TAP 2530 cannot modify the 
control of Mux3. This is an advantage since it allows the 
scan path length adjustment capability provided by Mux3 to 
take place only in response to instruction scan operations, 
and not during data scan operations. Also, while Mux3 is 
shown existing within the TAP 2530, it could exist external 
of the TAP as well. If it were external of the TAP, it would 
still be connected, as shown in FIG. 25B, to the instruction 
register 2538 and TDOl and TDU signals. 

U.S. Pat. No. 4,872,169, previously mentioned in regard 
to FIG. 15 A, describes an adjustable length scan path 
architecture. In this patent, the scan path length is adjustable 
during each scan operation. The above mentioned method of 
using Mux3 2532 to adjust the scan path length only during 
instruction scan operations, and not during data scan 
operations, is novel over the mentioned patent. Also, the 
above improvement is novel over conventional TAPs, since 
conventional TAPs (FIG. 25 A) do not have a Mux3 to 
provide the hierarchical capability to link or unlink the TDI 
to TDO scan path of a lower level TAP to or from the TDI 
and TDO scan path of a higher level TAP. Furthermore, 
conventional TAPs do not provide the capability of output- 
ting PEO control from a higher level TAP to enable or 
disable the operation of a lower level TAP, such that when 
enabled the TDI to TDO scan path of the lower level TAP is 
included in the TDI to TDO scan path of the higher level 
TAP, and when disabled, the TDI to TDO scan path of the 
lower level TAP is excluded from the TDI to TDO scan path 
of the higher level TAP. 

IEEE TAP Design with Instruction Adjustable Scan Length 
It is important to note that while the focus of this 
description is the design of a TAP that can coexist within a 
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scan distributor and scan collector architecture, the way the 
TAP is modified in regard to FIG. 25B to hierarchically 
insert or delete a lower level TAP scan path into and from a 
higher level TAP scan path is important independent of the 
scan distributor and scan collector architecture. This inven- 5 
tion includes the modifications of the conventional TAP as 
depicted in FIG. 25B and the above mentioned capability to 
only adjust the scan path length during instruction scan 
operations. 

The ability to enable or disable a TAP using a signal like 10 
PEI is known. A known example of how a TAP may be 
enabled or disabled is described in a paper entitled "An 
IEEE 1149.1 Based Test Access Architecture for integrated 
circuits with Embedded Cores" by Whetsel, published in the 
1997 IEEE International Test Conference proceedings. 

With the modifications described and shown in FIG. 25B, 
the TAP is seen to provide the same signal types as that seen 
in the controller of FIG. 15 A. For example in comparing 
FIGS. 25B and 15A it is seen that PEI relates to TEI, PEO 
relates to TEO, TDOl relates to SDOl, TDI1 relates to 
SDI1, TMS relates to TPI, TCK relates to TCI, TDI relates 
to SDI, and TOO relates to SDO. Further it is seen that the 
operation of the TAP of FIG. 25B and controller of FIG. 15A 
is similar. For example, (1) the TAP is enabled and disabled 



circuit to the second node of the second port, and from the 
second node of the second port to the second node of the first 
port. A circuit connected to the second port is disabled from 
communicating data with the first port if the enable output 
from the instruction register of the first port has been set low 
in response to an instruction register communication. In this 
case, communication occurs only from the first node of the 
first port, through the instruction or data register of the first 
port, and to the second node of the first port. 

If the circuit connected to the second port of the data path 
length adjustment circuit described above is another data 
path length adjustment circuit, it can be further connected at 
its second port to another circuit, which may also be a data 
path length adjustment circuit, and so on. This concept of 
is adjusting the length of a data communication by communi- 
cation to an instruction register is not limited to test data 
communication applications. It could be used in functional 
data communication applications as well. It is also indepen- 
dent of the physical implementation of the data path length 
adjustment circuit, which could be realized as a sub-circuit 
within an integrated circuit or core, or a device for use on a 
board or MCM. Also, while the data path length adjustment 
circuit has been described as having singular first and second 
nodes at the first and second ports, a plurality of first and 



20 



by the PEI signal, as the controller is enabled and disabled 25 second nodes on each of the first and second ports is possible 

by the TEI signal, (2) the TAP shifts data from TDI to TOO to support data path length adjustment of parallel data buses 

in response to TMS and TCK, as the controller shifts data as well. Furthermore, the connectivity arrangement provided 

from SDI to SDO in response to TPI and TCI, (3) the TAP by the data path length adjustment circuit could be altered 

is initialized at reset to exclude a TDOl to TOU scan path from the example given without departing from the spirit 

from the TDI and TOO scan path, as the controller is 30 and scope of the present invention, 

initialized at reset to exclude a SDOl to SDI1 scan path from U.S. Pat. Nos. 4,872,169 and 5,056,093, both by Whetsel, 

the SDI and SDO scan path, (4) the TAP's instruction adjust the length of scan paths. U.S. Pat. No. 4,872,169 

register can be loaded with control to include a TDOl to adjusts the length of scan paths by communication to a 

TDI1 scan path in the TDI and TOO scan path, as the control bit contained within the scan path. U.S. Pat. No. 

controller's test control register can be loaded with control 35 5,056,193 adjusts the length of scan paths by communica- 

to include a SDOl to SDI1 scan path is the SDI and SDO tion to a data register, following a first communication to an 

scan path, and (5) the TAP's instruction register can be instruction register. The data path length adjustment method 

loaded with control to exclude a TDOl and TO II scan path described above occurs in response to only instruction 

from the TDI and TOO scan path, as the controller's test register communication, 

control register can be loaded with control to exclude a 40 Operating State Machines with Shared I/O 



SDOl and SDI1 scan path from the SDI and SDO scan path. 
A conventional TAP (i.e. the TAP of FIG. 25A) instruction 
register is designed to include an update register as shown in 
FIG. 15B to prevent its control outputs from changing 
during shift operations. 

Instruction Based Data Path Length Adjustment 

The circuit of FIG. 25B can be generally viewed as a data 
path length adjustment circuit having first and second ports. 
The first port has an enable input (PEI) and first (TDI) and 



In FIG. 26, the TAP of FIG. 25B and controller of FIG. 
ISA are connected to allow both to coexists together in an 
integrated circuit or core. In FIG, 26, circuitry 2600 includes 
a test access port or TAP 2602 and a controller 2604. TAP 
45 2602 is like TAP 2530 and controller 2604 is like controller 
1500. The similarities between the TAP and controller allow 
both to share many of the same signals. For example, the 
TMS and TPI control signals can be provided by a single 
shared TMS/TPI signal, the TCK and TCI clock signals can 



second (TOO) nodes for communicating data. The second 50 be provided by a single shared TCK/TCI signal, the TDI and 

port has an enable output (PEO) and first (TDOl) and SDI data input signals can be provided by a single shared 

second (TDI1) nodes for communicating data. The first port TDI/SDI signal, the TOO and SDO data output signals can 

is enabled to communicate data between its first and second be provided by a single shared TOO/SDO signal, the TDOl 

nodes if the enable input is high, and is disabled from and SDOl data output signals can be provided by a single 

communicating data if the enable input is low. The data 55 shared TOOl/SDOl signal, and the TOU and SDI1 data 



communicated from the first and second nodes of the first 
port passes through either an instruction register or a data 
register contained within the first port. The enable output of 
the second port comes from the instruction register of the 
first port. 

A circuit connected to the second port is enabled to 
communicate data with the first port if the enable output 
from the instruction register of the first port has been set high 
in response to an instruction register communication. In this 



input signals can be provided by a single shared TDI1/SDI1 
signal. The advantage of sharing these signals is that it 
reduces the number of integrated circuit pads, such as 2610, 
required to access the TAP or controller at the integrated 
60 circuit level, and also reduces wiring interconnect between 
integrated circuit level TAPs and controllers and embedded 
core level TAPs and controllers. 

In FIG. 26, the PEI and TEI signals are not shared. This 
allows the TAP and controller to be enabled or disabled 



case, communication occurs from the first node of the first 65 individually. If neither the TAP or controller is being 
port, through the instruction or data register of the first port accessed, the PEI and TEI signals will be set to disable them, 
to the first node of the second port, through the connected If the TAP is being accessed, the PEI signal will enable the 
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TAP and the TEI signal will disable the controller. If the 
controller is being accessed, the TEI signal will enable the 
controller and the PEI signal will disable the TAP. The PEO 
and TEO outputs from the TAP and controller respectively, 
are also not shared to allow individual control outputs for 
setting the PEI and TEI inputs of embedded, lower level 
TAPs and controllers. When the TAP is enabled by PEI, 
buffers are enabled to allow the TAP to output on the shared 
TDO/SDO and TDOl/SDOl outputs. Likewise, when the 
controller is enabled by TEI, buffers are enabled to allow the 
controller to output on the shared TDO/SDO and TDOl/ 
SDOl output. 

It is important to see in the arrangement of FIG. 26 that 
two state machines, i.e. TAP and controller, are connected 
together using shared inputs and outputs. It is further seen 
that each state machine can be individually enabled to 
operate using the shared inputs and outputs to perform a 
function. 

Architecture Supporting Hierarchically Arranged IEEE 
1149.1 TAPs 

When the controller is enabled and the TAP is disabled, 
the controller operates as if the TAP were not present and in 
all the arrangements previously described in FIGS. 17, 18, 
and 19. For example, when the controller is enabled and the 
TAP is disabled, the controller can operate in the hierarchical 
arrangement of FIG. 17 to enable and connect up with lower 
level controllers. When the TAP is enabled and the controller 
is disabled, the TAP operates in a very similar way as 
previously described for the controllers in FIG. 17. For 
example, if the shared TAP and controller signals of FIG. 26 
were substituted for the controller signals in FIG. 17, and if 
"integrated circuit TAP", "Core 1 TAP within integrated 
circuit", and "Core 2 TAP within Core 1" were substituted 
for "integrated circuit Controller", "Core 1 Controller within 
integrated circuit", and "Core 2 Controller within Core 1", 
respectively, the process and description of hierarchically 
selecting a lower level TAP by scanning a higher level TAP 
would follow closely that given for the controller. 
Summarizing, the process would be to scan the instruction 
register of the highest level TAP to enable a lower level TAP, 
then scanning through both instruction registers of both 
TAPs to continue enabling further lower level TAPs or to 
load a test instruction to execute in both TAPs. 

Known operations performed by TAPs in integrated cir- 
cuits and cores include; (1) testing the interconnects between 
plural integrated circuits on a board and plural cores within 
an integrated circuit, (2) testing circuitry contained within an 
integrated circuit or core, and (3) executing emulation and 
debug functions of circuitry contained within an integrated 
circuit or core. The ability to hierarchically access the TAPs 
within integrated circuits and cores, as shown in FIG. 17, to 
perform these types of operations is therefore an important 
aspect of the present invention. 

I claim: 

1. A controller for use in a scan distributor and scan 
collector architecture system comprising: 

A. a test control register having an input for receiving a 
serial data input (SDI) and inputs for receiving control, 
the test control register having an output for providing 
a serial data output 1 (SDOl), and a control bus output 
that provides control; 

B. a test control state machine, the state machine having 
inputs for receiving a test protocol input (TPI), a test 
clock input (TCI), a test enable input (TEI), and control 
inputs from the test control register control bus output, 
the state machine also having outputs for providing a 
shift distributor and collector output (SHDC), a capture 



15 



collector output (CPC), a shift parallel scan path output 
(SHPSP), and a capture parallel scan path output 
(CPPSP), the state machine also having control outputs 
that are output to the test control register; and 
5 C. a multiplexer having control inputs from the test 
control register control bus output, a serial data output 
1 (SDOl) input from the test control register, and a 
serial data input 1 (SDI1), and the multiplexer having 
a serial data output (SDO). 
10 2. The controller of claim 1 including integrated circuit 

pads connected to the controller that carry the input and 

output signals to and from the controller. 

3. The controller of claim 1 including core terminals that 
are connected to the controller and that carry the input and 
output signals to and from the controller. 

4. The controller of claim 1 including scan distributor, 
scan collector, and scan path circuits that are connected to 
the SHDC, CPC, SHPSP, and CPPSP signals output from the 
controller. 

5. The controller of claim 1 including a lower level 
20 controller and in which the test control register includes an 

output for providing a test enable output (TEO) to the lower 
level controller. 

6. The controller of claim 1 in which the test control 
register includes an update register and a shift register, the 

25 shift register having a serial data input and a serial data 
output and receiving a reset and a clock input, the shift 
register connecting to the update register and the update 
register producing the control bus of signals. 

7. The controller of claim 1 including additional control- 
30 lers arranged in a hierarchy with the higher controller 

controlling the lower controller. 

8. The controller of claim 1 including additional control- 
lers arranged with one higher controller controlling other 
controllers through a multiplexer. 

9. An integrated circuit including plural controllers of 
claim 1 in which each controller acts independent of one 
another. 

10. An integrated circuit comprising; 
a first state machine having a mode input, a clock input, 

and an enable input, said first state machine enabled, by 
a first logic state on the enable input, to transition 
through states in response to the mode and clock inputs, 
and disabled, by a second logic state on the enable 
input, from transitioning through states in response to 
the mode and clock inputs; 
a second state machine having a mode input, a clock 
input, and an enable input, said second state machine 
enabled, by a first logic state on the enable input, to 
transition through states in response to the mode and 
clock inputs, and disabled, by a second logic state on 
the enable input, from transitioning through states in 
response to the mode and clock inputs; 
a first connection formed between the mode inputs of the 
55 first and second state machines and a first input pad of 
the integrated circuit; 
a second connection formed between the clock inputs of 
the first and second state machines and a second input 
pad of the integrated circuit; 
60 a third connection formed between the enable input of the 
first state machine and a third input pad of the inte- 
grated circuit; and 
a fourth connection formed between the enable input of 
the second state machine and a fourth input pad of the 
65 integrated circuit. 

11. A test access port circuit formed on an integrated 
circuit, said test access port circuit comprising; 
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an instruction register having a serial input connected to 
a first test data input terminal; 

a plurality of data registers each having serial inputs 
connected to the first test data input terminal; 

a state machine having inputs connected to a test clock 5 
input terminal and a test mode select input terminal; 

a first multiplexer having data inputs, each data input 
connected to a serial output of one of said plurality of 
data registers; 1Q 

a second multiplexer having first and second data inputs, 
said first data input connected to the output of the first 
multiplexer and said second data input connected to a 
serial output from the instruction register, said second 
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multiplexer having a data output connected to a first 
test data output terminal; 
a third multiplexer having first and second data inputs, 
said first data input connected to the output of the 
second multiplexer and said second data input con- 
nected to a second test data input terminal; said third 
multiplexer having a data output connected to a second 
test data output terminal. 
12. The test access port circuit of claim 11 wherein the 
first and third multiplexers receive control input from the 
instruction register, and the second multiplexer receives 
control input from the state machine. 



03/19/2004, EAST Version: 1.4.1 



